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TITLE OF THE INVENTION 

ASSAYS FOR NUCLEAR RECEPTOR LIGANDS USING FRET 

5 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional 
Application No. 60/061,385, filed 10/7/97, the contents of which are 
incorporated herein by reference in their entirety. 

10 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not applicable. 

REFERENCE TO MICROFICHE APPENDIX 
15 Not applicable. 

FIELD OF THE INVENTION 

This invention relates to methods of identifying novel 
agonists and antagonists of nuclear receptors utilizing the agonist- 
20 dependent interaction of such receptors with CREB-binding protein 

(CBP) or other nuclear receptor co-activators in which this interaction is 
detected by fluorescence resonance energy transfer. 

BACKGROUND OF THE INVENTION 

25 Nuclear receptors are a superfamily of ligand-activated 

transcription factors that bind as homodimers or heterodimers to their 
cognate DNA elements in gene promoters. The superfamily, with more 
than 150 members, can be divided into subfamilies (e.g. the steroid, 
retinoid, thyroid hormone, and peroxisome proliferator-activated 

30 [PPAR] subfamilies). Each subfamily may consist of several members 
which are encoded by individual genes (e.g. PPARa, PPARy, and 
PPAR5). In addition, alternative mRNA splicing can result in more 
than one isoform of these genes as in the case of specific PPARs (e.g. 
PPARyl and PPAR72). The nuclear receptor superfamily is involved in 

35 a wide variety of physiological functions in mammalian cells: e.g., 

differentiation, proliferation, and metabolic homeostasis. Dysfunction 
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or altered expression of specific nuclear receptors has been found to be 
involved in disease pathogenesis. 

The PPAR subfamily of nuclear receptors consists of three 
members: PPARa, PPARy, and PPAR5. PPARa is highly expressed in 

5 liver and kidney. Activation of PPARa by peroxisome proliferators 
(including hypolipidimic reagents such as fibrates) or medium and 
long-chain fatty acids is responsible for the induction of acyl-CoA 
oxidase and hydratase-dehydrogenase (enzymes required for 
peroxisomal P-oxidation), as well as cytochrome P450 4A6 (an enzyme 

10 required for fatty acid co-hydroxylase). Thus, PPARa has an important 
role in the regulation of lipid metabolism and is part of the mechanism 
through which hypolipidimic compounds such as fibrates exert their 
effects, PPARy is predominantly expressed in adipose tissue, Recently, 
a prostaglandin J2 metabolite, 15-Deoxy-D12,14-prostaglandin J2, has 

15 been identified as a potential physiological ligand of PPARy. Both 15- 
Deoxy-D12,14-prostaglandin J2 treatment of preadipocytes or retroviral 
expression of PPARy2 in fibroblasts induced adipocyte differentiation, 
demonstrating the role of PPARy in adipocyte differentiation and lipid 
storage. The demonstration that anti-diabetic and lipid-lowering 

20 insulin sensitizing compounds known as thiazolidinediones are high 
affinity ligands for PPARy suggests a broad therapeutic role for PPARy 
ligands in the treatment of diabetes and disorders associated with 
insulin resistance {e.g. obesity and cardiovascular disease). 

Nuclear receptor proteins contain a central DNA binding 

25 domain (DBD) and a COOH- terminal ligand binding domain (LBD). The 
DBD is composed of two highly conserved zinc fingers that target the 
receptor to specific promoter/enhancer DNA sequences known as 
hormone response elements (HREs). The LBD is about 200-300 amino 
acids in length and is less well conserved than the DBD. There are at 

30 least three functions for the LBD: dimerization, ligand binding, and 
transactivation. The transactivation function can be viewed as a 
molecular switch between a transcriptionally inactive and a 
transcriptionally active state of the receptor. Binding of a ligand which 
is an agonist flips the switch from the inactive state to the active state. 

35 The COOH-terminal portion of the LBD contains an activation function 
domain (AF2) that is required for the switch. 
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The ligand-induced nuclear receptor molecular switch is 
mediated through interactions with members of a family of nuclear 
receptor co-activators {e.g., CBP/p300, SRC-l/NcoA-1, TIF2/GRIP- 
l/NcoA-2, and p/CIP). Upon binding of agonist to its cognate receptor 

5 LBD, a conformational change in the receptor protein creates a co- 
activator binding surface and results in recruitment of co-activator(s) to 
the receptor and subsequent transcriptional activation. The binding of 
antagonist ligands to nuclear receptors will not induce the required 
conformational change and prevents recruitment of co-activator and 

10 subsequent induction of transcription. The co-activators CREB-binding 
protein (CBP) and p300 are two closely related proteins that were 
originally discovered by virtue of their ability to interact with the 
transcription factor CREB. These two proteins share extensive amino 
acid sequence homology. CBP can form a bridge between nuclear 

15 receptors and the basic transcriptional machinery (Kamei et al., 1996, 
Cell 85:403-414; Chakravarti et aL, 1996, Nature 383:99-103; Hanstein et 
al., 1996, Proc. Natl. Acad. Sci. USA 93:11540-11545; Heery et al., 1997, 
Nature 387:733-736). CBP also contains intrinsic histone 
acetyltransferase activity which could result in local chromatin 

20 rearrangement and further activation of transcription. Ligand- and 
AF2-dependent interaction between certain nuclear receptors and CBP 
has been demonstrated in in vitro pull down assays and far-western 
assays. This interaction is both necessary and sufficient for the 
transcriptional activation that is mediated by these nuclear receptors. 

25 Thus, an AF2 mutant of the estrogen receptor (ER) which abolishes the 
transcriptonal function of the receptor is incapable of interacting with 
CBP. 

The N-termini of CBP and p300 have been shown to interact 
with the ligand-binding domains of some nuclear receptors (Kamei et 
30 al., 1996, Cell 85:403-414, hereinafter "Kamei"), Kamei was able to 

demonstrate direct interaction of CBP and p300 with nuclear receptors 
by several different methods: 

(1) Kamei produced GST fusion proteins of the first 100 
amino acids of the N- terminus of CBP, These fusion proteins were run 
35 out on a polyacrylamide gel, transferred to a membrane, and the 
membrane was exposed to 32p -labeled ligand-binding domains of 

-3- 
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nuclear receptors. In the presence of ligand, a specific binding 
interaction between the CBP and nuclear receptor fragments was 
detected in that the 32p-l a beled ligand-binding domains were observed to 
bind to the bands on the membrane containing the GST-CBP fusion 
5 proteins. 

(2) Kamei also utilized the yeast two-hybrid system. The 
ligand-binding domain of the nuclear receptor fused to the DNA-binding 
domain of the LexA protein was used as bait. The amino terminal 
domain of CBP fused to the gal4 transactivation domain was used as 

10 prey. In the presence of ligand, a specific binding interaction (occurring 
in vivo, i.e., within the yeast) was observed between the CBP and nuclear 
receptor fragments. 

(3) Kamei observed ligand-induced binding between CBP 
and nuclear receptors via a gel-shift assay. This assay is based on the 

15 observation that, in the presence of ligand, nuclear receptors will bind to 
oligonucleotides containing their target recognition sequence. Such 
binding results in the formation of a nuclear receptor-ligand- 
oligonucleotide complex having a higher molecular weight than the 
oligonucleotide alone. This difference in molecular weight is detected 

20 via a shift in position of the 32p-l a beled oligonucleotide when it is run out 
on a polyacrylamide gel. Kamei found that a fragment of CBP (the N- 
terminal 100 amino acids) was capable of binding to the nuclear 
receptor-ligand-oligonucleotide complex and shifting the complex's 
position on the gel to an even higher molecular weight. 

25 (4) Kamei was able to co-immunoprecipitate CBP using 

antibodies to nuclear receptors in extracts from a variety of cells in the 
presence of ligand. 

(5) By the use of transcriptional activation assays, Kamei 
was able to demonstrate that nuclear receptors and CBP interact in a 

30 functional manner. Such transcriptional activation assays can indicate 
that two proteins are involved in a pathway that results in 
transcriptional activation but these assays do not prove that the 
interaction between the proteins is one of direct binding. 

By the above-described methods, Kamei was able to 
35 demonstrate specific binding interactions between CBP and the retinoic 
acid receptor (RAR), glucocorticoid receptor (GR), thyroid hormone 
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receptor (T3R), and retinoid X receptor (RXR). Kamei also demonstrated 
specific binding between the N-terminus of p300 and RAR. However, 
Kamei did not demonstrate specific binding between CBP, p300, or any 
other nuclear receptor co-activators and PPARs. 
5 What is striking about the methods used by Kamei is their 

extremely laborious and time consuming nature. Such methods 
involve, among other things, the construction of fusion proteins, the 
preparation of 32p-l a beled proteins, the construction of specialized 
expression vectors for the yeast two-hybrid assay and the transcriptional 
10 activation assays, the running of many gels, and the raising of 

antibodies. Most of these assays take days to carry out and preparing the 
reagents needed to carry them out may take weeks. Because of the 
complicated reagents that are involved in these assays and the time 
needed to prepare and run the assays, these assays tend to be costly. 
15 Investigators other than Kamei who have studied the interaction 

between nuclear receptors and CBP have also been forced to rely on such 
cumbersome methods (see, e.g., Chakravarti et al., 1996, Nature 383:99- 
103; Hanstein et al., 1996, Proc. Natl. Acad. Sri. USA 93:11540-11545; 
Heery et al., 1997, Nature 387:733-736). 
20 Kamei did not use the above-described methods to identify 

novel agonists or antagonists of nuclear receptors. The focus of Kamei 
was not on agonists or antagonists, but rather on the interaction 
between nuclear receptors and CBP. Although modifying the methods 
of Kamei to identify agonists or antagonists might be possible, such 
25 methods would suffer from serious disadvantages. This is because, as 
discussed above, all of the assays employed by Kamei to study the 
interaction of CBP and p300 with nuclear receptors are very laborious, 
slow, and costly. Given the therapeutic importance of steroid hormones 
such as estrogen, Cortisol, progesterone, and other nuclear receptor 
30 agonists such as thyroid hormone and antidiabetic thiazolidinedione 

compounds, the need for improved high- throughput screening assays to 
identify potential pharmaceutical compounds affecting nuclear 
receptors is clear. Historically, therapeutically useful nuclear receptor 
ligand compounds were identified by screening animal models, an 
35 approach which is even more labor intensive and time consuming than 
the methods used by Kamei. Also, approaches such as those used by 
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Kamei are ill-suited for the identification of antagonists of nuclear 
receptors. It is now widely appreciated that antagonists of nuclear 
receptors can be valuable therapeutic agents. Examples of such 
therapeutically useful antagonists are tamoxifene, raloxifene, and RU- 
5 486. 

What is needed is a high throughput, time and labor- 
saving, non-radioactive, inexpensive, and very reliable assay for the 
identification and characterization of both agonists and antagonists of 
nuclear receptors. Such an assay is provided by the present invention. 

10 

SUMMARY OF THE INVENTION 

The present invention provides novel methods of identifying 
agonists and antagonists of nuclear receptors. The methods take 
advantage of the agonist-dependent binding of nuclear receptors and 

15 GBP, p300, or other nuclear receptor co-activators. In the absence of 
agonist, binding between the nuclear receptor and CBP, p300, or other 
nuclear receptor co-activators does not occur. If agonist is present, 
however, such binding occurs and can be detected by fluorescence 
resonance energy transfer (FRET) between a fluorescently-labeled 

20 nuclear receptor and fluorescently-labeled CBP, p300, or other nuclear 
receptor co-activator. Antagonists can be identified by virtue of their 
ability to prevent or disrupt the agonist-induced interaction of nuclear 
receptors and CBP, p300, or other nuclear receptor co-activators. In 
contrast to prior art methods of identifying agonists and antagonists of 

25 nuclear receptors, the methods of the present invention, are simple, 
rapid, and less costly. 

The present invention provides a nuclear receptor or ligand 
binding domain thereof labeled with a fluorescent reagent for use in the 
above-described methods of identifying agonists and antagonists of 

30 nuclear receptors. The present invention also provides CBP, p300, or 
other nuclear receptor co-activator, or a binding portion thereof, labeled 
with a fluorescent reagent. 



-6- 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a method of fluorescently labelling a 
protein or polypeptide with Europium cryptate (Eu3+K). 
5 Figure 2 illustrates the format for experiments 1 and 2 of 

Table 1. 

Figure 3 illustrates the format for experiment 3 of 

Table 1. 

Figure 4 illustrates the format for experiment 4 of 

10 Table 1. 

Figure 5 shows the results of studies using the methods of 
the present invention with four known PPARy agonists, -o- = AD5075; 
= Pioglitazone; -X- = Troglitazone; --0- = BRL49653. 

Figure 6 shows a measurement of the binding constant for 
15 the interaction between hCBP and PPARylLBD. 

Figure 7A shows the amino acid sequence of human CBP 
(SEQ.ID.NO.:l). 

Figure 7B shows the nucleotide sequence of a cDNA 
encoding human CBP (SEQ.ID.NO.:2). The open reading frame is at 
20 positions 76-1290. 

Figure 8A shows the amino acid sequence of human 
PPARa (SEQ.ID.NO.:3). 

Figure 8B shows the nucleotide sequence of a cDNA 
encoding human PPARa (SEQ.ID.NO.:4). The open reading frame is at 
25 positions 217-1623. 

Figure 9A shows the amino acid sequence of human 
PPARyl (SEQ.ID.NO.:5). 

Figure 9B shows the nucleotide sequence of a cDNA 
encoding human PPARyl (SEQ JD.NO.:6). The open reading frame is at 
30 positions 173-1609. 

Figure 10A shows the amino acid sequence of human 
PPAR8 (SEQ.ID.NO.:7). 

Figure 10B-C shows the nucleotide sequence of a cDNA 
encoding human PPAR5 (SEQ.ID.NO.:8). The open reading frame is at 
35 positions 338-1663. 



-7- 
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DETAILED DESCRIPTION OF THE INVENTION 
For the purposes of this invention: 

- an "agonist" is a substance that binds to nuclear receptors 
in such a way that a specific binding interaction between the nuclear 
receptor and CBP or other nuclear receptor co-activator can occur. 

- an "antagonist" is a substance that is capable of preventing 
or disrupting the agonist-induced specific binding interaction between a 
nuclear receptor and CBP, p300, or another nuclear receptor co- 
activator. 

- a "ligand" of a nuclear receptor is an agonist or an 
antagonist of the nuclear receptor. 

- a "specific binding interaction," "specific binding," and 
the like, refers to binding between a nuclear receptor and CBP, p300, or 
other nuclear receptor co-activator which results in the occurrence of 
fluorescence resonance energy transfer between a fluorescent reagent 
bound to the nuclear receptor and a fluorescent reagent bound to CBP, 
p300, or other nuclear receptor co-activator. 

With respect to CBP, p300, or other nuclear receptor co- 
activators, a "binding portion" is that portion of CBP, p300, or other 
nuclear receptor co-activators that is sufficient for specific binding 
interactions with nuclear receptors. 

With respect to nuclear receptors, a "ligand binding 
domain" is that portion of a nuclear receptor that is sufficient to bind an 
agonist or antagonist of the nuclear receptor. 

The present invention provides a high throughput, time 
and labor-saving, non-radioactive, inexpensive, and very reliable assay 
for the identification and characterization of both agonists and 
antagonists of nuclear receptors. In a general embodiment, the present 
invention provides methods of identifying agonists and antagonists for 
any nuclear receptor for which CBP, p300, or another nuclear receptor 
binding protein is a co-activator. Such agonists and antagonists are 
identified by virtue of their ability to induce or prevent binding between 
the ligand binding domain of a nuclear receptor and CBP, p300, or other 
nuclear receptor co-activator. The interaction between the nuclear 
receptor and CBP, p300, or other nuclear receptor co-activator is 
monitored by observing the occurrence of fluorescence resonance energy 

-8- 
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transfer (FRET) between two fluorescent reagents. One fluorescent 
reagent is bound to the nuclear receptor; the other fluorescent reagent is 
bound to CBP, p300, or other nuclear receptor co-activator. The binding 
of fluorescent reagent to nuclear receptor, CBP, p300, or other nuclear 
receptor co-activator can be by a covalent linkage or a non-covalent 
linkage. 

The present invention makes use of fluorescence resonance 
energy transfer (FRET). FRET is a process in which energy is 
transferred from an excited donor fluorescent reagent to an acceptor 
fluorescent reagent by means of intermolecular long-range dipole-dipole 
coupling. FRET typically occurs over distances of about lOfc to 1006 and 
requires that the emission spectrum of the donor reagent and the 
absorbance spectrum of the acceptor reagent overlap adequately and that 
the quantum yield of the donor and the absorption coefficient of the 
acceptor be sufficiently high. In addition, the transition dipoles of the 
donor and acceptor fluorescent reagents must be properly oriented 
relative to one another. For a review of FRET and its applications to 
biological systems, see Clegg, 1995, Current Opinions in Biotechnology 
6:103-110. 

The present invention makes use of a nuclear receptor or 
ligand binding domain thereof labeled with a first fluorescent reagent 
and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof, labeled with a second fluorescent reagent. The second 
fluorescent reagent comprises a fluorophore capable of undergoing 
energy transfer by either (a) donating excited state energy to the first 
fluorescent reagent, or (b) accepting excited state energy from the first 
fluorescent reagent. In other words, according to the present invention, 
either the first or the second fluorescent reagents can be the donor or the 
acceptor during FRET. 

The first and second fluorescent reagents are 
spectropscopically complementary to each other. This means that their 
spectral characteristics are such that excited state energy transfer can 
occur between them. FRET is highly sensitive to the distance between 
the first and second fluorescent reagents. For example, FRET varies 
inversely with the sixth power of the distance between the first and 
second fluorescent reagents. In the absence of agonist, the first 

-9- 
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fluorescent reagent, bound to the nuclear receptor or ligand binding 
domain thereof, will not be near the second fluorescent reagent, bound to 
CBP, p300, or other nuclear receptor co-activator, or binding portion 
thereof. Thus, no FRET, or very little FRET, will be observed. In the 

5 presence of agonist, however, interaction between the nuclear receptor 
and CBP, p300, or other nuclear receptor co-activator will occur, thus 
bringing close together the first and the second fluorescent reagents, 
allowing FRET to occur and be observed. 

Accordingly, the present invention provides a method of 

10 identifying an agonist of a nuclear receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; and 

15 (c) a substance suspected of being an agonist of the 

nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and CBP, p300, or other nuclear receptor co- 

20 activator, or a binding portion thereof, will occur; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

25 In particular embodiments, the nuclear receptor is selected 

from the group consisting of steroid receptors, thyroid hormone 
receptors, retinoic acid receptors, peroxisome proliferator-activated 
receptors, retinoid X receptors, glucocorticoid receptors, vitamin D 
receptors, and "orphan nuclear receptors" such as LXR, FXR, etc. 

30 In a particular embodiment, the nuclear receptor or ligand 

binding domain thereof is a full-length nuclear receptor. In another 
embodiment, the nuclear receptor or ligand binding domain thereof is a 
ligand binding domain of a nuclear receptor. In another embodiment, 
the nuclear receptor or ligand binding domain thereof comprises an AF- 

35 2 site of a nuclear receptor. 
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In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof is a full-length PPAR. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is the ligand 
binding domain of a PPAR. In a further embodiment, the PPAR is 
5 selected from the group consisting of PPARa, PPARyl, PPARy2, and 
PPAR5. In a further embodiment, the ligand binding domain of the 
PPAR contains amino acid residues 176-478 of human PPARyl. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof contains amino acids 143-462 of human RARa. 

10 In another embodiment, the nuclear receptor or ligand binding domain 
thereof contains amino acids 122-410 of rat T3R0CI. In another 

embodiment, the nuclear receptor or ligand binding domain thereof 
contains amino acids 227-463 of mouse RXRy. In another embodiment, 
the nuclear receptor or ligand binding domain thereof contains amino 

15 acids 251-595 of human ER. 

In a particluar embodiment, the above-described methods 
utilize full-length CBP, either mouse or human. In other embodiments, 
the methods utilize amino acid residues 1-113 of human CBP. In 
another embodiment, the methods utilize amino acid residues 1-453 of 

20 human CBP. 

The conditions under which the methods described above 
are carried out are conditions that are typically used in the art for the 
study of protein-protein interactions: e.g., physiological pH; salt 
conditions such as those represented by such commonly used buffers as 

25 PBS; a temperature of about 4°C to about 55°C. The presence of 

commonly used non-ionic detergents, e,g. y NP-40®, sarcosyl, Triton X- 
100®, is optional. When europium cryptates are used as fluorescent 
reagents, reactions should contain KF at a concentration of at least 200 
mM. 

30 Heery et aL, 1997, Nature 387:733-736 showed that 

interactions between nuclear receptors and a variety of nuclear receptor 
co-activators are mediated by a short amino acid sequence in the nuclear 
receptor co-activators having the amino acid sequence LXXLL, where L 
is leucine and X represents any amino acid. Accordingly, the present 

35 invention can be practiced with a binding portion of a nuclear receptor 
co-activator, provided that the binding portion contains the amino acid 

- 11- 
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sequence LXXLL. Therefore, the present invention includes a method of 
identifying an agonist of a nuclear receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 
5 (b) a binding portion of a nuclear receptor co-activator, 

where the binding portion contains the amino acid sequence LXXLL, 
and where the binding portion is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
10 nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the binding portion of the nuclear receptor 
co-activator will take place; and 
15 (d) measuring fluorescence resonance energy transfer 

(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

In a particular embodiment, the nuclear receptor co- 
20 activator is selected from the group consisting of: human RIP- 140, 

human SRC-1, mouse TIF-2, human or mouse CBP, human or mouse 
p300, mouse TIF-1, and human TRIP proteins. 

In a particular embodiment, the nuclear receptor co- 
activator is human RIP- 140 and the binding portion includes a 
25 contiguous stretch of amino acids of human RIP- 140 selected from the 
group consisting of: positions 20-29, 132-139, 184-192, 266-273, 379-387, 
496-506, 712-719, 818-825, 935-944, and 935-942. 

In another embodiment, the nuclear receptor co-activator is 
human SRC-1 and the binding portion includes a contiguous stretch of 
30 amino acids of human SRC-1 selected from the group consisting of: 
positions 45-53, 632-640, 689-696, 748-755, and 1434-1441. 

In another embodiment, the nuclear receptor co-activator is 
mouse TIF-2 and the binding portion includes a contiguous stretch of 
amino acids of mouse TIF-2 selected from the group consisting of: 
3 5 positions 640-650, 689-699, and 744-754. 



- 12- 



WO 99/18124 



PCT/US98/21049 



In another embodiment, the nuclear receptor co-activator is 
human or mouse CBP and the binding portion includes a contiguous 
stretch of amino acids of human or mouse CBP selected from the group 
consisting of: positions 68-78 and 356-366. 
5 In another embodiment, the nuclear receptor co-activator is 

human or mouse p300 and the binding portion includes a contiguous 
stretch of amino acids of human or mouse p300 selected from the group 
consisting of: positions 80-90 and 341-351. 

In another embodiment, the nuclear receptor co-activator is 
10 mouse TIF-1 and the binding portion includes a contiguous stretch of 
amino acids of mouse TIF-1 containing positions 722-732. 

In another embodiment, the nuclear receptor co-activator is 
human TRIP2 and the binding portion includes a contiguous stretch of 
amino acids of human TRIP2 containing positions 23-33. 
15 In another embodiment, the nuclear receptor co-activator is 

human TRIP3 and the binding portion includes a contiguous stretch of 
amino acids of human TRIP3 containing positions 97-107. 

In another embodiment, the nuclear receptor co-activator is 
human TRIP4 and the binding portion includes a contiguous stretch of 
20 amino acids of human TRIP4 containing positions 36-46. 

In another embodiment, the nuclear receptor co-activator is 
human TRIPS and the binding portion includes a contiguous stretch of 
amino acids of human TRIP5 containing positions 26-36. 

In another embodiment, the nuclear receptor co-activator is 
25 human TRIP8 and the binding portion includes a contiguous stretch of 
amino acids of human TRIP8 containing positions 36-46. 

In another embodiment, the nuclear receptor co-activator is 
human TRIP9 and the binding portion includes a contiguous stretch of 
amino acids of human TRIP9 selected from the group consisting of: 
30 positions 73-83, 256-266 and 288-298. 

For amino acid sequences of nuclear receptor co-activators, 
see Yao et al., 1996, Proc. Natl Acad Sci. USA 93:10626-10631 (SRC-1); 
0§ate et al., 1995, Science 270:1354-1357 (SRC-1); Cavaillfes et al., 1995, 
EMBO J. 14:3741-3751 (RIP- 140); Voegel et al., 1996, EMBO J. 15:101-108 
35 (TIF-2); Kwok et al., 1994, Nature 370:223-226 (CBP); Arias et al., 1994, 
Nature 370:226-229 (CBP); Eckner et al., 1994, Genes Dev. 8:869-884 
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(p300); Le Douarin et al, 1995, EMBO J. 14:2020-2033 (TIF-1); Lee et al., 
1995, Nature 374:91-94 (TRIP proteins). 

The particular embodiments of the present invention 
described above are all particular embodiments of a more general 
method that is also part of the present invention. That general method 
is a method of identifying an agonist of a nuclear receptor that 
comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) a polypeptide containing the amino acid sequence 
LXXLL where the polypeptide is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the polypeptide will take place; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

In a particular embodiment, the amino acid sequence 
LXXLL is present in an a helical portion of the polypeptide. In another 
embodiment, the amino acid sequence LXXLL is present in an a helical 
portion of the polypeptide and the leucines form a hydrophobic face. 

The present invention provides methods for identifying 
antagonists of a nuclear receptor. Such methods are based on the ability 
of the antagonist to prevent the occurrence of agonist-induced binding 
between a nuclear receptor and CBP, p300, or other nuclear receptor co- 
activator, or to disrupt such binding after it has occurred. Thus, the 
present invention provides a method for identifying antagonists of 
nuclear receptors that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; 
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(c) an agonist of the nuclear receptor; and 

(d) a substance suspected of being an antagonist of the 
nuclear receptor; 

under conditions such that, in the absence of the substance, 
5 binding between the nuclear receptor or ligand binding domain thereof 
and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof will occur; and 

(e) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents when the 

10 substance is present and measuring FRET between the first and second 
fluorescent reagents when the substance is absent; 

where the a decrease in FRET when the substance is 
present indicates that the substance is an antagonist of the nuclear 
receptor. 

15 In particular embodiments, the nuclear receptor is selected 

from the group consisting of steroid receptors, thyroid hormone 
receptors, retinoic add receptors, peroxisome proliferator-activated 
receptors, retinoid X receptors, glucocorticoid receptors, vitamin D 
receptors, and "orphan nuclear receptors" such as LXR, FXR, etc. 

20 In a particular embodiment, the nuclear receptor or ligand 

binding domain thereof is a full-length nuclear receptor. In another 
embodiment, the nuclear receptor or ligand binding domain thereof is a 
ligand binding domain of a nuclear receptor. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is an AF-2 site of 

25 a nuclear receptor. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof is a full-length PPAR. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is the ligand 
binding domain of a PPAR. In a further embodiment, the PPAR is 

30 selected from the group consisting of PPARa, PPARy, and PPAR5. In a 
further embodiment, the ligand binding domain of the PPAR contains 
amino acid residues 176-478 of human PPARyl. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof contains amino acids 143-462 of human RARcc. 

35 In another embodiment, the nuclear receptor or ligand binding domain 
thereof contains amino acids 122-410 of rat T 3 R<xl. In another 
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embodiment, the nuclear receptor or ligand binding domain thereof 
contains amino acids 227-463 of mouse RXRy. In another embodiment, 
the nuclear receptor or ligand binding domain thereof contains amino 
acids 251-595 of human ER. 
5 In a particular embodiment, the above-described methods 

utilize full-length CBP, either mouse or human. In other embodiments, 
the methods utilize amino acid residues 1-113 of human CBP. In 
another embodiment, the methods utilize amino acid residues 1-453 of 
human CBP. 

10 The conditions under which the methods described above 

are carried out are conditions that are typically used in the art for the 
study of protein-protein interactions: e.g., physiological pH; salt 
conditions such as those represented by such commonly used buffers as 
PBS; a temperature of about 4°C to about 55°C. The presence of 

15 commonly used non-ionic detergents, e.g., NP-40®, sarcosyl, Triton X- 
100®, is optional. When europium cryptates are used as fluorescent 
reagents, reactions should contain KF at a concentration of at least 200 
mM. 

In principle, one could measure FRET by monitoring either 

20 (a) a decrease in the emission of the donor fluorescent reagent following 
stimulation at the donor's absorption wavelength and/or (b) an increase 
in the emission of the acceptor reagent following stimulation at the 
donor's absorption wavelength. In practice, FRET is most effectively 
measured by emission ratioing. Emission ratioing monitors the change 

25 in the ratio of emission by the acceptor over emission by the donor. An 
increase in this ratio signifies that energy is being transferred from 
donor to acceptor and thus that FRET is occurring. Emission ratioing 
can be measured by employing a laser-scanning confocal microscope. 
Emission ratioing is preferably done by splitting the emitted light from a 

30 sample with a dichroic mirror and measuring two wavelength bands 
(corresponding to the donor and the acceptor emission wavelengths) 
simultaneously with two detectors. Alternatively, the emitted light can 
be sampled consecutively at each wavelength (by using appropriate 
filters) with a single detector. In any case, these and other methods of 

35 measuring FRET are well known in the art. 
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Although a variety of donor and acceptor fluorescent 
reagents can be used in the practice of the present invention, preferred 
embodiments of the present invention make use of cryptates of 
fluorescent reagents as donor reagents. Inclusion of a substrate into the 

5 intramolecular cavity of a macropolycyclic ligand results in the 
formation of a cryptate. The macropolycyclic ligand shields the 
substrate from interaction with solvent and other solute molecules. If 
the substrate is a fluororescent reagent, formation of a cryptate may 
result in markedly different spectroscopic characteristics for the reagent 

10 as compared to the spectroscopic characteristics of the free reagent. 

The present invention includes the use of europium (EuIII) 
or terbium (TbUI) cryptates as donor fluorescent reagents. Such EuIII 0 r 
Tblll cryptates, as well as methods for their formation, are well known 
in the art. For example, see Alpha et al., 1987, Angew. Chem. Int. Ed. 

15 Engl. 26:266-267; Mathis, 1995, Clin. Chem. 41:1391-1397. A europium 
cryptate is formed by the inclusion of a europium ion into the 
intramolecular cavity of a macropolycyclic ligand which contains 
bipyridine groups as light absorbers. When europium cryptates are 
present in solution together with fluoride ions, a total shielding of the 

20 europium cryptate fluorescence is occurs. The molecular structure of a 
europium cryptate is shown below. 
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Europium cryptates can be conjugated to proteins by the use 
of well-known heterobifunctional reagents (see, e.g. , International 
Patent Application WO 89/05813; Prat et al., 1991, Anal. Biochem. 
195:283-289; Lopez et al., 1993, Clin. Chem. 39:196-201). 

5 The present invention includes the use of XL665 as the 

acceptor fluorescent reagent. XL665 is a crosslinked derivative of 
allophycocyanin (APC). APC is a porphyrin containing protein which is 
derived from the light harvesting system of algae (Kronick, 1986, M. 
Immunol. Meth. 92:1-13). XL665 has an absorption maximum at =620 

10 nm and an emission maximum at 665 run. In some embodiments of the 
invention, XL665 is labeled with streptavidin in order to effect the 
binding of the streptavidin-labled XL665 to a biotin-labeled substance, 
e.g. , CBP or the ligand binding domain of a nuclear receptor. 
Streptavidin labeling of XL655 and biotin labeling of CBP, or the ligand 

15 binding domain of a nuclear receptor, can be performed by well known 
methods. 

In a preferred embodiment of the invention, XL665 as the 
acceptor fluorescent reagent is combined with Europium cryptate 
(Eu3+K) as the donor fluorescent reagent. Europium cryptate (Eu3+K) 

20 has a large Stokes shift, absorbing light at 337 nm and emitting at 620 
nm. Thus, the emission maximum of Europium cryptate (Eu3+K) 
overlaps the absorption maximum of XL665. Europium cryptate 
(Eu3+K) has a large temporal shift; the time between absorption and 
emission of a photon is about 1 millisecond. This is advantageous 

25 because most background fluorescence signals in biological samples are 
short-lived. Thus the use of a fluorescent reagent such as europium 
cryptate, with a long fluorescent lifetime, permits time-resolved 
detection resulting in the reduction of background interference. 

The spectral and temporal properties of europium cryptate 

30 (Eu3+K) result in essentially no fluorescence background and thus 

assays using this fluorescent reagent can be carried out in a "mix and 
read" mode, greatly facilitating its use as a high throughput screening 
tool. For the embodiment using Europium cryptate (Eu3+K) and XL665, 
the measuring instrument irradiates the sample at 337 run and 

35 measures the fluorescence output at two wavelengths, 620 nm (B counts, 
europium fluorescence) and 665 nm (A counts, XL665 fluorescence). 
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The extent of flurorescent resonance energy transfer is measured as the 
ratio between these two values. Typically this ratio is multiplied by 
10,000 to give whole numbers. 

Other FRET donor-acceptor pairs are suitable for the 

5 practice of the present invention. For example, the following donor- 
acceptor pairs can be used: dansyl/fluorescein; fluorescein/rhodamine; 
tryptophan/aminocoumarin. 

The present invention provides a nuclear receptor or ligand 
binding domain thereof labeled with a fluorescent reagent for use in the 

10 above-described methods of identifying agonists and antagonists of 
nuclear receptors. The present invention also provides CBP, p300, or 
other nuclear receptor co-activator, or a binding portion thereof, labeled 
with a fluorescent reagent. 

In a particular embodiment, the nuclear receptor or ligand 

15 binding domain thereof is selected from the group consisting of PPARot, 
PPARy, PPAR8, a ligand binding domain of PPARot, PPARy, or PPAR5, 
and amino acid residues 176-478 of human PPARyl and the fluorescent 
reagent is selected from the group consisting of XL665 and Europium 
cryptate (Eu3+K). 

20 In a particular embodiment, CBP, p300, or other nuclear 

receptor co-activator is labeled with a fluorescent reagent selected from 
the group consisting of XL665 and Europium cryptate (Eu3+K). 

The following non-limiting examples are presented to better 
25 illustrate the invention. 

EXAMPLE 1 

Cloning, expressio n, and purification of human CBP and PPAR 
proteins 

30 To test whether human CBP can interact with PPARs in an 

agonist-dependent manner, we cloned the human cDNA fragments 
encoding the NH2-terminal 1-113 amino acids (hCBPl-113) and 1-453 

amino acids (hCBPl-453) of human CBP by the polymerase chain 
reaction (PCR). The DNA and amino acid sequences of human CBP are 
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disclosed in Borrow et aL, 1996, Nature Genet. 14:33-41 and in GenBank, 
accession no. U47741. 

The primers used for hCBPl-113 were: 
5'-ACTCGGATCCAAGCCATGGCTGAGAACTTGCTGGACGG-3 , 
5 (SEQ.ID.NO.:9) and 

S'-CACAAAGCTTAGGCCATGTTAGCACTGTTCGG-S' (SEQ.ID.NO.: 
10). 

These primers were expected to amplify a 0.9 kb DNA fragment. 
The primers for hCBPl-453 were: 
10 5 , -ACTCGGATCCAAGCCATGGCTGAGAACTTGCTGGACGG-3 , 
(SEQ.ID.NO.:9) and 

5 , CTCAGTCGACTTATTGAATTCCACTAGCTGGAGATCC-3 , 
(SEQ.ID.NO.rll). 

These primers were expected to amplify a 1.5 kb DNA fragment.. 

15 The template for the PCR reaction was a human fetal brain 

cDNA library (Stratagene, Catalogue #IS 937227). Of course, any 
human cDNA library from a tissue expressing CBP could have been 
used. The PCR amplified 0.9 kb and 1.5 kp DNA fragments which were 
digested with restriction endonucleases and ligated into pBluescript II 

20 vector. DNA sequencing analysis confirmed that the amplified 

fragments were identical to the corresponding published nucleic acid 
sequences of human CBP. 

Based on the publicly available sequences for human CBP 
cited above, other primers could be readily identified and prepared by 

25 those skilled in the art in order to amplify and clone other portions of 
cDNA encoding human CBP from appropriate cDNA libraries. Once 
such portions of human CBP are produced, they could be used in the 
methods of the present invention in a manner similar to that described 
herein for hCBPl-113 and hCBPl-453. The amino acid sequence of 

30 human CBP is shown in Figure 7 A; the nucleic acid sequence of the 
cDNA encoding human CBP is shown in Figure 7B. 

To express the polypeptides encoded by the PCR fragments, 
vectors encoding fusion proteins of the polypeptides and glutathione S- 
transferase (GST) were constructed and expressed in E. coli. The PCR 

35 fragments were subcloned into the expression vector pGEX (Pharmacia 
Biotech) to generate pGEXhCBPl-113 and pGEXhCBPl-453. 
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pGEXhCBPl-113 and pGEXhCBPl-453 were transfected into the DH5a 
strain of E. coli (GIBCO BRL) and the bacteria hosting either 
pGEXhCBPl-113 or pGEXhCBPl-453 were cultured in LB medium 
(GIBCO BRL) to a density of OD600 = 0.7-1.0 and induced for 
5 overexpression of the GST-CBP fusion proteins by addition of IPTG 
(isopropylthio-(3-galactoside) to a final concentration of 0.2 mM. The 
IPTG induced cultures were further grown at room temperature for 2-5 
hrs. The cells were harvested by centrifugation for 10 min at 5000g. The 
cell pellet was used for GST-CBP fusion protein purification by following 

10 the procedure from Pharmacia Biotech using Glutathione Sepharose 
beads. hCBPl-113 and hCBP 1-453 proteins were generated by cleaving 
the corresponding GST fusion proteins with thrombin. SDS- 
polyacrylamide gel electrophoresis analysis showed that the preparation 
from pGEXhCBPl-113 gave two polypeptide bands, with apparent 

15 molecular weight of 12 kd and 10 kd. The 12 kd band is the expected size 
of hCBPl-113 and the 10 kd band is most likely a premature translational 
termination product. The preparation from pGEXhCBPl-450 gave a 
single band with the expected size, 50 kd. 

cDNAs encoding full-length PPARa and PPARyl were 

20 subcloned into pGEX vectors for the production of GST-PPARa and GST- 
PPARyl fusion proteins in E.coli. PPARyl was cloned from a human fat 
cell cDNA library (see Elbrecht et al., 1996, Biochem. Biophys. Res. 
Comm. 224:431-437). A cDNA encoding the human PPARyl ligand 
binding domain (PPARylLBD; amino acids 176-478 of PPARyl) was 

25 subcloned from a modified pSG5 vector as a Xho I (site located in the N- 
terminus of the LBD)/ Xba I (site located in the pSG5 vector) fragment. 
The Xba I site was blunt-ended with T4 DNA polymerase. The 1.1 kb 
fragment containing the LBD was purified from an agarose gel and 
ligated into pGEX-KG (see Guan & Dixon, 1991, Anal. Biochem. 192:262- 

30 267) that had been digested with Xho I and Hind III (the Hind III site 
had been blunt-ended with T4 DNA polymerase). This construct was 
used for the production of GST-hPPARylLBD and hPPARylLBD (the 
ligand binding domain cleaved free of GST). The overexpression and 
purification of PPARa, PPARyl, and PPARylLBD were as described 

35 above for CBP. 
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The DNA and amino acid sequences of human PPARa are 
disclosed in Schmidt et al., 1992, Mol. Endocrinol. 6:1634-1641 and in 
GenBank, accession no. L07592. See Figure 8A and 8B. 

The DNA and amino acid sequences of human PPARyl are 
5 disclosed in Greene et al., 1995, Gene Expr. 4:281-299; Qi et al., 1995, Mol. 
Cell. Biol. 15:1817-1825; Elbrecht et al., 1996, Biochem. Biophys. Res. 
Comm. 224:431-437; and in GenBank, accession no. L40904. See Figure 
9A and 9B. Human PPARy2 contains the same amino acid sequence as 
human PPARyl except for an amino terminal addition of 24 amino acids 

10 (see Elbrecht et al., 1996, Biochem. Biophys. Res. Comm. 224:431-437). 
Thus, the amino acid sequence of the ligand binding domain of human 
PPARy2 is the same as the amino acid sequence of the ligand binding 
domain of human PPARyl, although the numbering of the amino acids 
differs (176-478 for human PPARyl and 200-502 for human PPARy2). 

15 The DNA and amino acid sequences of human PPAR5 are 

disclosed in Sher et al., 1993, Biochemistry 32:5598-5604 and in GenBank, 
accession no. L02932. See Figure 10A-C. 

EXAMPLE 2 

20 Interaction between PPARs and hCBP fragments 

Experiments were first conducted using hCBPl-113 and 
hPPARylLBD. Purified hPPARylLBD was biotinylated with Sulfo-NHS- 
LC-Biotin (PIERCE) to a biotmihPPARylLBD ratio of 3:1 according to the 
procedure provided by PIERCE. Purified hCBPl-113 was directly labeled 

25 with europium cryptate (Eu3+K) by the method illustrated in Figure 1. 
Biotin-labeled hPPARylLBD, Eu3+K-labeled hCBPl-113, and 
streptavidin-labeled XL665 (SA-XL665; from PACKARD) were incubated 
together in the presence or absence of 1 |^M of known PPARy agonist 
(BRL49653 orAD5075). 

30 Thus, this experimental format made use of the fluorescent 

reagent pair europium cryptate (Eu3+K), which acted as donor, and 

XL665, which acted as acceptor. hCBPl-113 was directly labeled with 

europium cryptate (Eu3+K); hPPARylLBD was indirectly labeled with 

XL665 by means of a biotin-streptavidin link. The emission maximum 
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of europium cryptate (Eu3+K) overlaps with the absorption maximum of 
XL665. Therefore, when europium cryptate (Eu3+K) and XL665 are in 
close proximity, and the sample is illuminated with light at 337 nm (the 
absorption maximum of europium cryptate (Eu3+K)), FRET can occur 
5 between europium cryptate (Eu3+K) and XL 665. This FRET manifests 
itself as increased emission at 665 nm by XL665. Figure 2 shows a 
schematic of the format used in this experiment (experiment 1 of Table 
1). When agonist is bound to hPPARylLBD, a specific interaction occurs 
between hPPARylLBD and hCBPl-113, thus bringing europium cryptate 

10 (Eu3+K) and XL665 into close enough proximity for FRET to occur. In 
the absence of agonist, no interaction occurs between hPPARylLBD and 
hCBPl-113 and thus europium cryptate (Eu3+K) and XL665 are not 
brought into close proximity and no FRET occurs. When FRET occurs, 
the amount of light given off by the sample at the emission maximum of 

15 XL665 (665 nm) is increased relative to the amount of light given off by 
the sample at the emission maximum of europium cryptate (Eu3+K) 
(620 nm). Therefore, measuring the ratio of emission at 665 nm to 620 
nm in the presence and the absence of a substance suspected of being an 
agonist allows for the determination of whether that substance actually 

20 is an agonist. If the substance is an agonist, an increase in the ratio of 
emission at 665 nm to 620 nm in the presence of the substance will be 
observed. 

Reactions were carried out in microtiter plates. Reaction 
conditions were: appropriate volume (total 250 jj!) of the reaction buffer 

25 (either PBS or HEPES, see below, containing 500 mM KF, 0.1% bovine 
serum albumin, BSA) was added to each well, followed by addition of 
ligands (BRL49653 or AD5075 at a final concentration of 1 ^iM and 0.1% 
dimethylsulfoxide (DMSO) or vehicle control (0.1% DMSO), Eu3+K 
labeled hCBP (100 nM), biotin-hPPARylLBD (100 nM), and streptavidin- 

30 labeled XL665 (100 nM) to appropriate wells. After mixing, 200 |il of 
reaction mixture was transferred to a new well. The plate was either 
directly measured for fluorescence resonance energy transfer (FRET) or 
covered with sealing tape (PACKARD) to avoid evaporation and 
incubated at room temperature for up to 24 hrs before measuring FRET. 

35 The results of this experiment and others described below 

yielded ratio values as follows: 
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Table 1 



Experiment 


Buffer 


Emission ratio 
with AD5075 


Emission ratio 
with vehicle 


1 


PBS 


1134 


1074 


2 


HEPES + 0.05% 
NP40 


967 


617 


3 


HEPES + 0.05% 
NP40 


1078 


536 


4 


HEPES + 0.05% 
CHAPS 


1883 


487 
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Experiment 1 of Table 1 was carried out using PBS (137 mM 
NaCl, 2.7 mM KC1, 4.3 mM Na2HP04, 1.4 mM KH2PO4, pH 7.4). The 
greater emission ratio observed in the presence of AD5075 demonstrated 
that a specific interaction between hCBPl-113 and hPPARylLBD 
5 occurred in the presence of the agonist AD5075. Although it was clear 
that FRET was occurring, the signal-noise ratio was small. In 
experiment 2 of Table 1, HEPES buffer (N-2-hydroxyethylpiperazine-N'- 
2-ethane sulfonic acid, 100 mM, pH 7.0) containing 0.05% NP40 (Nonidet 
P-40) was used instead of PBS and an improved signal-noise ratio was 
10 obtained. 

In order to get an even better signal-noise ratio, the above- 
described format was modified slightly for experiment 3. In experiment 
3, SA-XL665 (500 nM), biotin-labeled hPPARYlLBD (100 nM), GST- 
hCBPl-113, and Eu3+K labeled anti-GST antibody (2.5 ul) were incubated 
15 in the presence or absence of AD5075 (1 uM) in HEPES buffer containing 
0.05% NP40. A two-fold signal- noise ratio was obtained. Figure 3 shows 
a schematic of the format used in experiment 3. 

The anti-GST antibody was a goat antibody to GST from 
Pharmacia (catalogue number 27-4577-01) that was labeled with Eu3+K 
according to the procedure summarized below. 

- Make up @ 10 
mg/mL in H20. 
Need 42.2 ug (4.2 
uL, 96.6 nmol) for 
49.0 ug Eu3+ 
Reagent 



20 




NaO a S — C I '' \ / o 

Eu^O NH 2 

CU J 0 FW = 436.4 



-Resuspended @ -FW = 1465 2 9 Equiv SULFO-SMCC, 

2.5 mg/mL in 10% Use 49.0 ug 20 mM Pi buffer, 10% DMF 

DMF/PBS (19.6 uL, 33.4 nmol) RT, 30 minutes 
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Anti-GST Antibody, 
Cat #27-4577-01 



Anti-GST Antibody, 
Cat # 27-4577-01 



'NH, 



5 equiv. Eu3+ 
complex 




1 ) Add Eu reagent 

2) [Pr] 4.2 mg/mL. 
Onight at 4°C 

Lower pH: Add 12 pL of 
lMNaPi, pH 7.0. 
pH drops to 7. 18. 

350 pM TCEP 

(35 mM stock is 

10.0 mg/mL- PBS,pH 

7.0), 2.4 pL, 

15 min rt then 15 min @ 

4°C 



O' 



From Pharmacia, 5.0 mg/mL, 
FW = 150 kD Use 200 uL (1 mg, 
6.66 nmol) exchange into 10 mM 
Borate, 350 mM NaCl, 10% Gly, 
pH 8.5 with BioSpin-30 



5.0 Equiv SPDP, 
RT, 5 hours 



FW=312, Dissolve 
@1.00 mg/mL in EtOH. 
Add 10.4 uL (5 equiv., 
10.4 jig, 33.4 nmol) to 
protein. 



To farther improve the signal to noise ratio, a series of 
experiments were conducted. Experiment 4 of Table 1 exemplifies 
5 results obtained from those efforts. cDNA encoding a longer fragment of 
hCBP was cloned and expressed to get hCBPl-453. hCBPl-453 was 
biotinylated. Biotin-labeled hCBPl-453 (25 nM), SA-XL665 (100 nM), 
GST-hPPARylLBD (1 nM), and Eu3+K-labeled anti-GST antibody (2 nM) 
were mixed together in the presence or absence of 1 \iM AD5075. The 
10 detergent was changed from 0.05% NP40 to 0.5% CHAPS (3-{[3- 

cholamidopropyl]dimethyl-ammoniol}-l-propanesulfonate). A three- to 
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four-fold signal-noise ratio was obtained. Figure 4 shows the strategy 
used for experiment 4 and similar experiments. 

The correlation between results from the above-described 
assays and previously reported results from in vitro binding and 
5 transcriptional activation assays of selected antidiabetic insulin 

sensitizers that are known to be PPARy agonists (Elbrecht et al., 1996, 
Biochem Biophys Res Comm 224:431-437) was analyzed by titrating those 
known PPARy agonists in the assays described above and comparing 
EC50s so obtained with previously described values for potency in 
10 binding or transcriptional activation assays for the known agonists. The 
results are shown in Figure 5. From Figure 5, the following EC5OS can 
be derived: 

AD5075 = 8 nM 
BRL49653 = 53 nM 
15 Troglitazone = 646 nM 

Pioglitazone = 890 nM. 
These EC50s generated in the above-described assays are in close 
agreement with those generated by in vitro binding and transcriptional 
activation studies (Elbrecht et al., 1996, Biochem Biophys Res Comm 
20 224:431-437). 

The above-described assay can also be used to characterize 
the interaction between nuclear receptors with co-activators as, e.g., by 
determining the binding constant for that interaction. Figure 6 shows 
an example of such an application. Saturating amounts of PPARy 

25 agonist (10 nM BRL49653) were used. Increasing concentrations of non- 
biotinylated hCBPl-453 were used to titrate away biotin-hCBP- 
PPARylLBD complex and decrease the fluorescence energy transfer. A 
Kd of 300 nM for the interaction between hCBPl-453 and PPARylLBD 
can be derived from the results illustrated in Figure 6 and this Kd (300 

30 nM) is a measurement ofthe affinity between CBP and PPARy. 

The present invention is not to be limited in scope by the 
specific embodiments described herein. Indeed, various modifications 
of the invention in addition to those described herein will become 
apparent to those skilled in the art from the foregoing description. Such 

35 modifications are intended to fall within the scope ofthe appended 
claims. 
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Various publications are cited herein, the disclosures of 
which are incorporated by reference in their entireties. 
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WHAT IS CLAIMED: 

1. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
5 labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; and 

(c) a substance suspected of being an agonist of the 
nuclear receptor; 

10 under conditions such that, if the substance is an agonist of 

the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and CBP, p300, or other nuclear receptor co- 
activator, or a binding portion thereof, will occur; and 

(d) measuring fluorescence resonance energy transfer 
15 (FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

2. The method of claim 1 where the nuclear receptor or 
20 ligand binding domain thereof is selected from the group consisting of 

steroid receptors, thyroid hormone receptors, retinoic acid receptors, 
peroxisome proliferator-activated receptors, retinoid X receptors, 
glucocorticoid receptors, vitamin D receptors, LXR, and FXR. 

25 3. The method of claim 1 where the nuclear receptor or 

ligand binding domain thereof is selected from the group consisting of a 
full-length nuclear receptor, a ligand binding domain of a nuclear 
receptor, and an AF-2 site of a nuclear receptor. 

30 4. The method of claim 1 where the nuclear receptor or 

ligand binding domain thereof comprises an AF-2 site of a nuclear 
receptor. 

5. The method of claim 1 where the nuclear receptor or 
35 ligand binding domain thereof is selected from the group consisting of a 
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full-length PPAR, a ligand binding domain of a PPAR, and amino acid 
residues 176-478 of human PPARyl. 

6. The method of claim 1 where the nuclear receptor or 
5 ligand binding domain thereof is selected from the group consisting of 

PPARa, PPARyl, PPARy2, and PPAR5. 

7. The method of claim 1 where the nuclear receptor or 
ligand binding domain thereof comprises a ligand binding domain 

10 selected from the group consisting of amino acids 143-462 of human 
RARa, amino acids 122-410 of rat T3Ral, amino acids 227-463 of mouse 
RXRy, and amino acids 251-595 of human ER. 

8. The method of claim 1 where CBP, p300, or other 
15 nuclear receptor co-activator, or a binding portion thereof is selected 

from the group consisting of full-length human CBP, full-length mouse 
CBP, amino acid residues 1-113 of human CBP, and amino acid residues 
1-453 of human CBP. 

20 9. The method of claim 1 where the first fluorescent 

reagent is selected from the group consisting of XL665 and Europium 
cryptate (Eu3+K). 

10. The method of claim 1 where the second fluorescent 
25 reagent is selected from the group consisting of XL665 and Europium 

cryptate (Eu3+K). 

11. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 

30 (a) a nuclear receptor or ligand binding domain thereof 

labeled with a first fluorescent reagent; 

(b) a binding portion of a nuclear receptor co-activator, 
where the binding portion contains the amino acid sequence LXXLL, 
and where the binding portion is labeled with a second fluorescent 

35 reagent; and 
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(c) a substance suspected of being an agonist of the 
nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the binding portion of the nuclear receptor 
co-activator will take place; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

12. The method of claim 11 where the binding portion of a 
nuclear receptor co-activator is selected from the group consisting of 
human RIP-140, human SRC-1, mouse TIF-2, human or mouse CBP, 
human or mouse p300, mouse TIF-1, and human TRIP proteins. 

13. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) a polypeptide containing the amino acid sequence 
LXXLL where the polypeptide is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the polypeptide will take place; and 

(d) measuring fluorescent resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

14. A method for identifying an antagonist of a nuclear 
receptor that comprises providing: 
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(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; 

5 (c) an agonist of the nuclear receptor; and 

(d) a substance suspected of being an antagonist of the 
nuclear receptor; 

under conditions such that, in the absence of the substance, 
binding between the nuclear receptor or ligand binding domain thereof 
10 and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof will occur; and 

(e) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents when the 
substance is present and measuring FRET between the first and second 

15 fluorescent reagents when the substance is absent; 

where the a decrease in FRET when the substance is 
present indicates that the substance is an antagonist of the nuclear 
receptor. 

20 15. The method of claim 14 where the nuclear receptor or 

ligand binding domain thereof is selected from the group consisting of 
steroid receptors, thyroid hormone receptors, retinoic acid receptors, 
peroxisome proliferator-activated receptors, retinoid X receptors, 
glucocorticoid receptors, vitamin D receptors, LXR, and FXR. 

25 

16. The method of claim 14 where the nuclear receptor or 
ligand binding domain thereof is selected from the group consisting of a 
full-length nuclear receptor, a ligand binding domain of a nuclear 
receptor, and an AF-2 site of a nuclear receptor. 

30 

17. The method of claim 14 where the nuclear receptor or 
ligand binding domain thereof comprises an AF-2 site of a nuclear 
receptor. 

35 18. The method of claim 14 where the nuclear receptor or 

ligand binding domain thereof is selected from the group consisting of a 
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full-length PPAR, a ligand binding domain of a PPAR, and amino acid 
residues 176-478 of human PPARyl. 

19. The method of claim 14 where the nuclear receptor or 
5 ligand binding domain thereof is selected from the group consisting of 

PPARa, PPARyl, PPARy2, and PPAR8. 

20. The method of claim 14 where the nuclear receptor or 
ligand binding domain thereof comprises a ligand binding domain 

10 selected from the group consisting of amino acids 143-462 of human 
RARa, amino acids 122-410 of rat T3Ral, amino acids 227-463 of mouse 
RXRy, and amino acids 251-595 of human ER. 

21. The method of claim 14 where CBP, p300, or other 
15 nuclear receptor co-activator, or a binding portion thereof is selected 

from the group consisting of full-length CBP, amino acid residues 1-113 
of human CBP, and amino acid residues 1-453 of human CBP. 

22. The method of claim 14 where the first fluorescent 
20 reagent is selected from the group consisting of XL665 and Europium 

cryptate (Eu3+K). 

23. The method of claim 14 where the second fluorescent 
reagent is selected from the group consisting of XL665 and Europium 

25 cryptate (Eu3+K). 

24. A nuclear receptor or ligand binding domain thereof 
labeled with a fluorescent reagent. 

30 25. The nuclear receptor or ligand binding domain 

thereof of claim 24 where the nuclear receptor or ligand binding domain 
thereof is selected from the group consisting of PPARa, PPARyl, 
PPARy2, PPAR5, a ligand binding domain of PPARa, PPARyl, PPARy2, 
or PPAR8, and amino acid residues 176-478 of human PPARyl and the 

35 fluorescent reagent is selected from the group consisting of XL665 and 
Europium cryptate (Eu3+K). 
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26. CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a fluorescent reagent. 

27. The CBP, p300, or other nuclear receptor co-activator, 
or a binding portion thereof, of claim 26 where the fluorescent reagent is 
selected from the group consisting of XL665 and Europium cryptate 
(Eu3+K). 
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1 MAENLLDGPPNPKRAKLSSPGFSANDSTDFGSLFDLENDLPDEL I PNGGELGLLNSGNL V 

61 PDAASKHKQLSELLRGGSGSSINPGIGNVSASSPVQQGLG6QAQGQPNSANMASLSAMGK 

121 SPLSQGDSSAPSLPKQAASTSGPTPAASQALNPQAQKQVGLATSSPATSQTGPGICMNAN 

1B1 FNQTHPGLLNSNSGHSLINQASQGQAQVMNGSLGAAGRGRGAGMPYPTPAMQGASSSVLA 

241 ETLTQVSPQMTGHAGLNTAQAGGMAKMGITGNTSPFGQPFSQAGGQPMGATGVNPQLASK 

301 Q5MVNSLPTFPTDIKNTSVTNVPNMSQMQTSVGIVPTQAIATGPTADPEKRKLIQQQLVL 

361 LLHAHKCQRREQANGEVRACSLPHCRTMKNVLNHMTHCQAGKACQ 

FIG.7A 
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ggccgcgccg cccgtgcccg gggctgtttt 
ctggacggac cgcccaaccc caaaagagcc 
gacagcacag attttggatc attgtttgac 
cccaatggag gagaattagg ccttttaaac 
aaacataaac aactgtcgga gcttctacga 
ataggaaatg tgagcgccag cagccccgtg 
cagccgaaca gtgctaacat ggccagcctc 
ggagattctt cagcccccag cctgcctaaa 
gctgcctccc aagcactgaa tccgcaagca 
cctgccacgt cacagactgg acctggtatc 
ccaggcctcc tcaatagtaa ctctggccat 
gcgcaagtca tgaatggatc tcttggggct 
taccctactc cagccatgca gggcgcctcg 
gtttccccgc aaatgactgg tcacgcggga 
aagatgggaa taactgggaa cacaagtcca 
cagccaatgg gagccactgg agtgaacccc 
agtttgccca ccttccctac agatatcaag 
tctcagatgc aaacatcagt gggaattgta 
gcagatcctg aaaaacgcaa actgatacag 
aagtgtcaga gacgagagca agcaaacgga 
cgaaccatga aaaacgtttt gaatcacatg 
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1 MVDTESPLCPLSPLEAGDLESPLSEEFLQEMGNIQEISQSIGEDSSGSFGFTEYQYLGSC 
61 PGSDGSVITDTLSPASSPSSVTYPVVPGSVDESPSGALNIECRICGDKASGYHYGVHACE 
121 GCKGFFRRTIRLKLVYDKCDRSCKIQKKNRNKCQYCRFHKCLSVGMSHNAIRFGRMPRSE 
181 KAKLKAEILTCEHDIEDSETADLKSLAKRI YEAYLKNFNMNKVKARV ILSGKASNNPPFV 
241 IHDMETLCMAEKTLVAKLVANGIQNKEVEVRIFHCCQCTSVETVTELTEFAKAIPAFANL 
301 DLNDQVTLLKYGVYEAIFAMLSSVMNKDGMLVAYGNGFITREFLKSURKPFCDIMEPKFD 
361 FAMKFNALELDDSDISLFVAAI ICCGDRPGLLNVGHIEKMQEGIVHVURLHLQSNHPDDI 
421 FL PKLLQKMADLRQLVTEHAQLVQI I KKTESDAALHPLLQE I YRDMY 

FIG.8A 

1 ggcccaggct gaagctcagg gccctgtctg ctctgtggac tcaacagttt gtggcaagac 
61 aagctcagaa ctgagaagct gtcaccacag ttctggaggc tgggaagttc aagatcaaag 
121 tgccagcaga ttcagtgtca tgtgaggacg tgcttcctgc ttcatagata agagtagctt 
181 ggagctcggc ggcacaacca gcaccatctg gtcgcgatgg tggacacgga aagcccactc 
241 tgccccctct ccccactcga ggccggcgat ctagagagcc cgttatctga agagttcctg 
301 caagaaatgg gaaacatcca agagatttcg caatccatcg gcgaggatag ttctggaagc 
361 tttggcttta cggaatacca gtatttagga agctgtcctg gctcagatgg ctcggtcatc 
421 acggacacgc tttcaccagc ttcgagcccc tcctcggtga cttatcctgt ggtccccggc 
481 agcgtggacg agtctcccag tggagcattg aacatcgaat gtagaatctg cggggacaag 
541 gcctcaggct atcattacgg agtccacgcg tgtgaaggct gcaagggctt ctttcggcga 
601 acgattcgac tcaagctggt gtatgacaag tgcgaccgca gctgcaagat ccagaaaaag 
661 aacagaaaca aatgccagta ttgtcgattt cacaagtgcc tttctgtcgg gatgtcacac 
721 aacgcgattc gttttggacg aatgccaaga tctgagaaag caaaactgaa agcagaaatt 
781 cttacctgtg aacatgacat agaagattct gaaactgcag atctcaaatc tctggccaag 
841 agaatctacg aggcctactt gaagaacttc aacatgaaca aggtcaaagc ccgggtcatc 
901 ctctcaggaa aggccagtaa caatccacct tttgtcatac atgatatgga gacactgtgt 
961 atggctgaga agacgctggt ggccaagctg gtggccaatg gcatccagaa caaggaggtg 
1021 gaggtccgca tctttcactg ctgccagtgc acgtcagtgg agaccgtcac ggagctcacg 
1081 gaattcgcca aggccatccc agcgttcgca aacttggacc tgaacgatca agtgacattg 
1141 ctaaaatacg gagtttatga ggccatattc gccatgctgt cttctgtgat gaacaaagac 
1201 gggatgctgg tagcgtatgg aaatgggttt ataactcgtg aattcctaaa aagcctaagg 
1261 aaaccgttct gtgatatcat ggaacccaag tttgattttg ccatgaagtt caatgcactg 
1321 gaactggatg acagtgatat ctcccttttt gtggctgcta tcatttgctg tggagatcgt 
1381 cctggccttc taaacgtagg acacattgaa aaaatgcagg agggtattgt acatgtgctc 
1441 agactccacc tgcagagcaa ccacccggac gatatctttc tcttcccaaa acttcttcaa 
1501 aaaatggcag acctccggca gctggtgacg gagcatgcgc agctggtgca gatcatcaag 
1561 aagacggagt cggatgctgc gctgcacccg ctactgcagg agatctacag ggacatgtac 
1621 tgagttcctt cagatcagcc acaccttttc caggagttct gaagctgaca gcactacaaa 
1681 ggagacgggg gagcagcacg attttgcaca aatatccacc actttaacct tagagcttgg 
1741 acagtctgag ctgtaggtaa ccggcatatt attccatatc tttgttttaa ccagtacttc 
1801 taagagcata gaactcaaat gctgggggag gtggctaatc tcaggactgg gaag 
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1 MTMVDTE I AFWPTNFG I SSVDLSVMEDH5HSFD I KPFTTVDFSS I STPHYED I PFTRTDP 
61 VVADYKYDLKLQEYQSAIKVEPASPPYYSEKTQLYNKPHEEPSNSLMAIECRVCGDKASG 
121 FHYGVHACEGCKGFFRRTI RLKL I YDRCDLNCR I HKKSRNKCQYCRFQKCLAVGMSHNAI 
181 RFGRIAQAEKEKLLAEISSDIDQLNPESADLRQALAKHLYDSYIKSFPLTKAKARAILTG 
241 KTTDKSPFVI YDMNSLMMGEDKIKFKHITPLQEQSKEVAIRIFQGCQFRSVEAVQEITEY 
301 AKSI PGFVNLDLNDQVTLLKYGVHEI I YTMLASLMNKDGVLISEGQGFMTREFLKSLRKP 
361 FGDFMEPKFEFAVKFNALELDDSDLAIFIAVI ILSGDRPGLLNVKPIEDIQDNLLQALEL 
421 QLKLNHPESSQUAKLLQKMTDLRQIVTEHVQLLQV1KKTETDMSLHPLLQEIYKDLY 

FIG.9A 



1 ccgaccttac cccaggcggc cttgacgttg gtcttgtcgg caggagacag caccatggtg 

61 ggttctctct gagtctggga attcccgagc ccgagccgca gccgccgcct ggggggcttg 

121 ggtcggcctc gaggacaccg gagaggggcg ccacgccgcc gtggccgcag aaatgaccat 

181 ggttgacaca gagatcgcat tctggcccac caactttggg atcagctccg tggatctctc 

241 cgtaatggaa gaccactccc actcctttga tatcaagccc ttcactactg ttgacttctc 

301 cagcatttct actccacatt acgaagacat tccattcaca agaacagatc cagtggttgc 

361 agattacaag tatgacctga aacttcaaga gtaccaaagt gcaatcaaag tggagcctgc 

421 atctccacct tattattctg agaagactca gctctacaat aagcctcatg aagagccttc 

481 caactccctc atggcaattg aatgtcgtgt ctgtggagat aaagcttctg gatttcacta 

541 tggagttcat gcttgtgaag gatgcaaggg tttcttccgg agaacaatca gattgaagct 

601 tatctatgac agatgtgatc ttaactgtcg gatccacaaa aaaagtagaa ataaatgtca 

661 gtactgtcgg tttcagaaat gccttgcagt ggggatgtct cataatgcca tcaggtttgg 

721 gcggatcgca caggccgaga aggagaagct gttggcggag atctccagtg atatcgacca 

781 gctgaatcca gagtccgctg acctccgtca ggccctggca aaacatttgt atgactcata 

841 cataaagtcc ttcccgctga ccaaagcaaa ggcgagggcg atcttgacag gaaagacaac 

901 agacaaatca ccattcgtta tctatgacat gaattcctta atgatgggag aagataaaat 

961 caagttcaaa cacatcaccc ccctgcagga gcagagcaaa gaggtggcca tccgcatctt 

1021 tcagggctgc cagtttcgct ccgtggaggc tgtgcaggag atcacagagt atgccaaaag 

1081 cattcctggt tttgtaaatc ttgacttgaa cgaccaagta actctcctca aatatggagt 

1141 ccacgagatc atttacacaa tgctggcctc cttgatgaat aaagatgggg ttctcatatc 

1201 cgagggccaa ggcttcatga caagggagtt tctaaagagc ctgcgaaagc cttttggtga 

1261 ctttatggag cccaagtttg agtttgctgt'gaagttcaat gcactggaat tagatgacag 

1321 cgacttggca atatttattg ctgtcattat tctcagtgga gaccgcccag gtttgctgaa 

1381 tgtgaagccc attgaagaca ttcaagacaa cctgctacaa gccctggagc tccagctgaa 

1441 gctgaaccac cctgagtcct cacagctgtt tgccaagctg ctccagaaaa tgacagacct 

1501 cagacagatt gtcacggaac acgtgcagct actgcaggtg atcaagaaga cggagacaga 

1561 catgagtctt cacccgctcc tgcaggagat ctacaaggac ttgtactagc agagagtcct 

1621 gagccactgc caacatttcc cttcttccag ttgcactatt ctgagggaaa atctgaccat 

1681 aagaaattta ctgtgaaaaa gcgttttaaa aagaaaaggg tttagaatat gatctatttt 

1741 atgcatattg tttataaaga cacatttaca atttactttt aatattaaaa attaccatat 

1801 tatgaaattg c 
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1 MEQPQEEAPEVREEEEKEEVAEAEGAPELNGGPQHALPSSSY7DLSRSSSPPSLLDQLQM 

61 GCPGASCGSLNMECRVCGDKASGFHYGVHACEGCKGFFRRTIRMKLEYEKCERSCKIQKK 

121 NRNKCQYCRFQKCLALGMSHNAIRFGRMPEAEKRKLVAGLTANEGSQYNPQVADLKAFSK 

181 H I YNAYLKNFNMTKKKARS I LTGKASHTAPFV I HDI ETLWQAE KGLVWKQLVNGLPP YKE 

241 ISVHVFYRCQCTTVETVRELTEFAKSIPSFSSLFLNDQVTLLKYGVHEAIFAMLASIVNK 

301 DGLLVANGSGFVTREFLRSLRKPFSDI IEPKFEFAVKFNALELDD5DLALFIAAI ILCGD 

361 RPGLMNVPRVEAIQDTILRALEFHLQANHPDAQYLFPKLLQKMADLRQLVTEHAQMMQRI 

421 KKTETETSLHPLLQEIYKDMY 



FIG.10A 
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1 gaattctgcg gagcctgcgg gacggcggcg ggttggcccg taggcagccg ggacagtgtt 

61 gtacagtgtt ttgggcatgc acgtgatact cacacagtgg cttctgctca ccaacagatg 

121 aagacagatg caccaacgag ggtctggaat ggtctggagt ggtctggaaa gcagggtcag 

181 atacccctgg aaaactgaag cccgtggagc aatgatctct acaggactgc ttcaaggctg 

241 atgggaacca ccctgtagag gtccatctgc gttcagaccc agacgatgcc agagctatga 

301 ctgggcctgc aggtgtggcg ccgaggggag atcagccatg gagcagccac aggaggaagc 

361 ccctgaggtc cgggaagagg aggagaaaga ggaagtggca gaggcagaag gagccccaga 

421 gctcaatggg ggaccacagc atgcacttcc ttccagcagc tacacagacc tctcccggag 

481 ctcctcgcca ccctcactgc tggaccaact gcagatgggc tgtgacgggg cctcatgcgg 

541 cagcctcaac atggagtgcc gggtgtgcgg ggacaaggca tcgggcttcc actacggtgt 

601 tcatgcatgt gaggggtgca agggcttctt ccgtcgtacg atccgcatga agctggagta 

661 cgagaagtgt gagcgcagct gcaagattca gaagaagaac cgcaacaagt gccagtactg 

721 ccgcttccag aagtgcctgg cactgggcat gtcacacaac gctatccgtt ttggtcggat 

781 gccggaggct gagaagagga agctggtggc agggctgact gcaaacgagg ggagccagta 

841 caacccacag gtggccgacc tgaaggcctt ctccaagcac atctacaatg cctacctgaa 

901 aaacttcaac atgaccaaaa agaaggcccg cagcatcctc accggcaaag ccagccacac 

961 ggcgcccttt gtgatccacg acatcgagac attgtggcag gcagagaagg ggctggtgtg 

1021 gaagcagttg gtgaatggcc tgcctcccta caaggagatc agcgtgcacg tcttctaccg 

1081 ctgccagtgc accacagtgg agaccgtgcg ggagctcact gagttcgcca agagcatccc 

1141 cagcttcagc agcctcttcc tcaacgacca ggttaccctt ctcaagtatg gcgtgcacga 

1201 ggccatcttc gccatgctgg cctctatcgt caacaaggac gggctgctgg tagccaacgg 

1261 cagtggcttt gtcacccgtg agttcctgcg cagcctccgc aaacccttca gtgatatcat 

1321 tgagcctaag tttgaatttg ctgtcaagtt caacgccctg gaacttgatg acagtgacct 

1381 ggccctattc attgcggcca tcattctgtg tggagaccgg ccaggcctca tgaacgttcc 

1441 acgggtggag gctatccagg acaccatcct gcgtgccctc gaattccacc tgcaggccaa 

1501 ccaccctgat gcccagtacc tcttccccaa gctgctgcag aagatggctg acctgcggca 

1561 actggtcacc gagcacgccc agatgatgca gcggatcaag aagaccgaaa ccpagacctc 

1621 gctgcaccct ctgctccagg agatctacaa ggacatgtac taacggcggc acccaggcct 

1681 ccctgcagac tccaatgggg ccagcactgg aggggcccac ccacatgact tttccattga 

1741 ccagctctct tcctgtcttt gttgtctccc tctttctcag ttcctctttc ttttctaatt 

1801 cctgttgctc tgtttcttcc tttctgtagg tttctctctt cccttctccc ttctcccttg 

1861 ccctcccttt ctctctccta tccccacgtc tgtcctcctt tcttattctg tgagatgttt 

1921 tgtattattt caccagcagc atagaacagg acctctgctt ttgcacacct tttccccagg 

1981 agcagaagag agtgggcctg ccctctgccc catcattgca cctgcaggct taggtcctca 

2041 cttctgtctc ctgtcttcag agcaaaagac ttgagccatc caaagaaaca ctaagctctc 

2101 tgggcctggg ttccagggaa ggctaagcat ggcctggact gactgcagcc ccctatagtc 

2161 atggggtccc tgctgcaaag gacagtggca gaccccggca gtagagccga gatgcctccc 

2221 caagactgtc attgcccctc cgatcgtgag gccacccact gacccaatga tcctctccag 

2281 cagcacacct cagccccact gacacccagt gtccttccat cttcacactg gtttgccagg 

2341 ccaatgttgc tgatggcccc tccagcacac acacataagc actgaaatca ctttacctgc 

2401 aggcaccatg cacctccctt ccctccctga ggcaggtgag aacccagaga gaggggcctg 
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2461 caggtgagca ggcagggctg ggccaggtct 
2521 tgggtcagcc cagcacctcg cccagtggga 
2581 ctgatgtcca tttgtcccaa tagctctact 
2641 tggccaccta gaagtctccc tgcacagcct 
2701 ccacaccgct ggtccctgcc ctcccctgct 
2761 gggccaaagc acagctgggc atgccatgtc 
2821 gggcaagggg ctggctggag tctcagagca 
2881 ggcttcctgg gtcctgcctg gtcctccctc 
2941 gaagtgccca gcccctgcct gacggnnnnn 
3001 cgctccccac ctacccagct gatgggggtt 
3061 ggacagctgc tgggacccac ctcccccctt 
3121 cccgggtctg gtgctgagga tacagctctt 
3181 atgtatattt ttgctaggag ccccagcttc 
3241 actgacgaaa ctttaaataa atgggaatta 
3301 c 



ccggggaggc aggggtcctg caggtcctgg 
gcttcccggg ataaactgag cctgttcatt 
gccctcccct tcccctttac tcagcccagc 
ctagtgtccg gggaccttgt gggaccagtc 
cccaggttga ggtgcgctca cctcagagca 
tgagcggcgc agagccctcc aggcctgcag 
cagaggtagg agaactgggg ttcaagccca 
ccaaggagcc attctatgtg actctgggtg 
nngatcactc tctgctggca ggattcttcc 
ggggtgcttc tttcagccaa ggctatgaag 
ccccggccac atgccgcgtc cctgccccca 
ctcagtgtct gaacaatctc caaaattgaa 
ctgtgttttt aatataaata gtgtacacag 
aatatttaaa aaaaaaagcg gccgcgaatt 
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(D) SOFTWARE: FastSEQ for Windows Version 2.0b 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 : 



Met 


Ala 


Glu 


Asn 


Leu 


Leu 


Asp 


Gly 


Pro 


Pro 


Asn 


Pro 


Lys 


Arg 


Ala 


Lys 


1 








5 










10 








15 


Leu 


Ser 


Ser 


Pro 


Gly 


Phe 


Ser 


Ala 


Asn 


Asp 


Ser 


Thr 


Asp 


Phe 


Gly 


Ser 








20 










25 








30 




Leu 


Phe 


Asp 


Leu 


Glu 


Asn 


Asp 


Leu 


Pro 


Asp 


Glu 


Leu 


He 


Pro 


Asn 


Gly 






35 










40 










45 






Gly 


Glu 


Leu 


Gly 


Leu 


Leu 


Asn 


Ser Gly 


Asn 


Leu 


Val 


Pro 


Asp 


Ala 


Ala 




50 










55 










60 








Ser 


Lys 


His 


Lys 


Gin 


Leu 


Ser 


Glu 


Leu 


Leu 


Arg Gly Gly 


Ser 


Gly 


Ser 


65 










70 










75 








80 


Ser 


lie 


Asn 


Pro 


Gly 
85 


He 


Gly 


Asn 


Val 


Ser 
90 


Ala 


Ser 


Ser 


Pro 


Val 
95 


Gin 


Gin 


Gly 


Leu Gly 


Gly 


Gin 


Ala 


Gin 


Gly 


Gin 


Pro 


Asn 


Ser 


Ala 


Asn 


Met 








100 










105 










110 






Ala 


Ser 


Leu 


Ser 


Ala 


Met 


Gly 


Lys 


Ser 


Pro 


Leu 


Ser 


Gin 


Gly 


Asp 


Ser 






115 










120 










125 




Ser 


Ala 


Pro 


Ser 


Leu 


Pro 


Lys 


Gin 


Ala 


Ala 


Ser 


Thr 


Ser 


Gly 


Pro 


Thr 




13 0 










135 










140 








Pro 


Ala 


Ala 


Ser 


Gin 


Ala 


Leu 


Asn 


Pro 


Gin 


Ala 


Gin 


Lys 


Gin 


Val 


Gly 


14b 










150 










155 








160 


Leu 


Ala 


Thr 


Ser 


Ser 


Pro 


Ala 


Thr 


Ser 


Gin 


Thr Gly 


Pro 


Gly 


lie 


Cys 










165 










170 










175 


Met 


Asn 


Ala 


Asn 


Phe 


Asn 


Gin 


Thr 


His 


Pro Gly 


Leu 


Leu 


Asn 


Ser 


Asn 








180 










185 










190 








Gly 


His 


Ser 


Leu 


He 


Asn 


Gin 


Ala 


Ser Gin Gly Gin 


Ala 


Gin 


Val 






195 










200 










205 








Met 


Asn 


Gly Ser 


Leu 


Gly 


Ala 


Ala Gly 


Arg Gly Arg Gly 


Ala Gly Met 




210 










215 










220 










Pro 


Tyr 


Pro 


Thr 


Pro 


Ala 


Met 


Gin 


Gly 


Ala 


Ser 


Ser 


Ser 


Val 


Leu 


Ala 


225 










230 










235 










240 


Glu 


Thr 


Leu 


Thr 


Gin 
245 


Val 


Ser 


Pro 


Gin 


Met 
250 


Thr 


Gly 


His 


Ala 


Gly 
255 


Leu 


Asn 


Thr 


Ala 


Gin 


Ala 


Gly 


Gly 


Met 


Ala 


Lys 


Met 


Gly 


He 


Thr 


Gly 


Asn 








260 










265 








270 




Thr 


Ser 


Pro 


Phe 


Gly 


Gin 


Pro 


Phe 


Ser 


Gin 


Ala Gly Gly 


Gin 


Pro 


Met 






275 










280 










285 








Gly 


Ala 


Thr Gly 


Val 


Asn 


Pro 


Gin 


Leu 


Ala 


Ser 


Lys 


Gin 


Ser 


Met 


Val 




290 










295 










300 










Asn 


Ser 


Leu 


Pro 


Thr 


Phe 


Pro 


Thr 


Asp 


He 


Lys 


Asn 


Thr 


Ser 


Val 


Thr 


305 










310 










315 










320 


Asn 


Val 


Pro 


Asn 


Met 


Ser 


Gin 


Met 


Gin 


Thr 


Ser 


Val 


Gly 


He 


Val 


Pro 










325 










33Q 








335 




Thr 


Gin 


Ala 


He 


Ala 


Thr 


Gly 


Pro 


Thr 


Ala 


Asp 


Pro 


Glu 


Lys 


Arg 


Lys 








340 










345 










350 


Leu 


He 


Gin 


Gin 


Gin 


Leu 


Val 


Leu 


Leu 


Leu 


His 


Ala 


His 


Lys 


Cys 


Gin 






355 










360 










365 




Arg Arg Glu Gin 


Ala 


Asn 


Gly 


Glu 


Val 


Arg 


Ala 


Cys 


Ser 


Leu 


Pro 


His 




370 










375 










380 










Cys 


Arg 


Thr 


Met 


Lys 


Asn 


Val 


Leu 


Asn 


His 


Met 


Thr 


His 


Cys 


Gin 


Ala 


385 










390 










395 








400 


Gly Lys 


Ala 


Cys 


Gin 























405 



-2 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Val Asp Thr Glu Ser Pro Leu Cys Pro Leu Ser Pro Leu Glu Ala 

1 5 io 15 

Gly Asp Leu Glu Ser Pro Leu Ser Glu Glu Phe Leu Gin Glu Met Gly 

20 25 30 

Asn He Gin Glu He Ser Gin Ser He Gly Glu Asp Ser Ser Gly Ser 

35 40 45 

Phe Gly Phe Thr Glu Tyr Gin Tyr Leu Gly Ser Cys Pro Gly Ser Asp 
50 55 60 



60 
120 
180 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

CGAGCCCCGA CCCCCGTCCG GGCCCTCGCC GGCCGCGCCG CCCGTGCCCG GGGCTGTTTT 
CCCGAGCAGG TGAAAATGGC TGAGAACTTG CTGGACGGAC CGCCCAACCC CAAAAGAGCC 

AAACTCAGCT CGCCCGGTTT CTCGGCGAAT GACAGCACAG ATTTTGGATC ATTGTTTGAC icsu 

TTGGAAAATG ATCTTCCTGA TGAGCTGATA CCCAATGGAG GAGAATTAGG CCTTTTAAAC 240 

AGTGGGAACC TTGTTCCAGA TGCTGCTTCC AAACATAAAC AACTGTCGGA GCTTCTACGA 300 

GGAGGCAGCG GCTCTAGTAT CAACCCAGGA ATAGGAAATG TGAGCGCCAG CAGCCCCGTC 360 

CAGCAGGGCC TGGGTGGCCA GGCTCAAGGG CAGCCGAACA GTGCTAACAT GGCCAGCCTC 420 

AGTGCCATGG GCAAGAGCCC TCTGAGCCAG GGAGATTCTT CAGCCCCCAG CCTGCCTAAA 480 

CAGGCAGCCA GCACCTCTGG GCCCACCCCC GCTGCCTCCC AAGCACTGAA TCCGCAAGCA 540 

CAAAAGCAAG TGGGGCTGGC GACTAGCAGC CCTGCCACGT CACAGACTGG ACCTGGTATC 600 

TGCATGAATG CTAACTTTAA CC AGACCCAC CCAGGCCTCC TCAATAGTAA CTCTGGCCAT 660 

AGCTTAATTA ATCAGGCTTC ACAAGGGCAG GCGCAAGTCA TGAATGGATC TCTTGGGGCT 720 

GCTGGCAGAG GAAGGGGAGC TGGAATGCCG TACCCTACTC CAGCCATGCA GGGCGCCTCG 780 

AGCAGCGTGC TGGCTGAGAC CCTAACGCAG GTTTCCCCGC AAATGACTGG TCACGCGGGA 840 

CTGAACACCG CACAGGCAGG AGGCATGGCC AAGATGGGAA TAACTGGGAA CACAAGTCCA 900 

TTTGGACAGC CCTTTAGTCA AGCTGGAGGG CAGCCAATGG GAGCCACTGG AGTGAACCCC 960 

CAGTTAGCCA GCAAACAGAG CATGGTCAAC AGTTTGCCCA CCTTCCCTAC AGATATCAAG 1020 

AATACTTCAG TCACCAACGT GCCAAATATG TCTCAGATGC AAACATCAGT GGGAATTGTA 1080 

CCCACACAAG CAATTGCAAC AGGCCCCACT GCAGATCCTG AAAAACGCAA ACTGATACAG 1140 

CAGCAGCTGG TTCTACTGCT TCATGCTCAT AAGTGTCAGA GACGAGAGCA AGCAAACGGA 1200 

GAGGTTCGGG CCTGCTCGCT CCCGCATTGT CGAACCATGA AAAACGTTTT GAATCACATG 1260 
ACGCATTGTC AGGCTGGGAA AGCCTGCCAA 



1290 
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pi,, 

U ly 


Ser 


val 


He 


Thr 


Asp 


Thr 


Leu 


Ser 


Pro 


65 










70 










val 


Thr 


Tyr 


Pro 


Val 


Val 


Pro 


Gly 


Ser 


Val 










85 








90 


Ala 


Leu 


Asn 


He 


Glu 


Cys 


Arg 


He 


Cys 


Gly 








100 










105 


HIS 


Tyr 


Gly 


Val 


His 


Ala 


Cys 


Glu 


Gly 


Cys 






115 










120 






lnr 


He 


Arg 


Leu 


Lys 


Leu 


Val 


Tyr 


Asp 


Lys 




1 J 0 










135 








He 


Gin 


Lys 


Lys 


Asn 


Arg 


Asn 


Lys 


Cys 


Gin 


14b 










150 








Cys 


Leu 


Ser 


Val 


Gly 


Met 


Ser 


His 


Asn 


Ala 










165 










170 


Pro 


Arg 


Ser 


Glu 


Lys 


Ala 


Lys 


Leu 


Lys 


Ala 








180 










185 




Ml S 


Asp 


He 


Glu 


Asp 


Ser 


Glu 


Thr 


Ala 


Asp 






19 b 










200 




Arg 


He 


Tyr 


Glu 


Ala 


Tyr 


Leu 


Lys 


Asn 


Phe 




OTA 

210 










215 






Ala 


Arg 


Val 


He 


Leu 


Ser 


Gly 


Lys 


Ala 


Ser 












230 








He 


His 


Asp 


Met 


Glu 


Thr 


Leu 


Cys 


Met 


Ala 










245 










250 


Lys 


T , 

Leu 


Val 


Ala 


Asn 


Gly 


He 


Gin 


Asn 


Lys 








260 










265 


irne 


HIS 


Cys 


cys 


Gin 


Cys 


Thr 


Ser 


Val 


Glu 






275 










280 






LrlU 


rne 


Ala 


Lys 


Ala 


He 


Pro 


Ala 


Phe 


Ala 




T A A 










295 










Val 


Thr 


Leu 


Leu 


Lys 


Tyr 


Gly 


Val 


Tyr 












310 








Leu 


Ser 


Ser 


Val 


Met 


Asn 


Lys 


Asp 


Gly 


Met 










325 










330 


Gly 


Phe 


He 


Thr 


Arg 


Glu 


Phe 


Leu 


Lys 


Ser 








340 










345 




Asp 


He 


Met 


Glu 


Pro 


Lys 


Phe 


Asp 


Phe 


Ala 






355 










360 






blU 


Leu 


Asp 


Asp 


Ser 


Asp 


He 


Ser 


Leu 


Phe 




370 










375 








cys 


Gly 


Asp 


Arg 


Pro 


Gly 


Leu 


Leu 


Asn 


Val 


Job 










390 










ply, 

tain 


Glu Gly 


He 


Val 


His 


Val 


Leu 


Arg 


Leu 










405 










410 


Pro 


Asp 


Asp 


He 


Phe 


Leu 


Phe 


Pro 


Lys 


Leu 








420 










425 




Leu 


Arg 


Gin 


Leu 


Val 


Thr 


Glu 


His 


Ala 


Gin 






43 5 










440 






Lys 


Thr 


Glu 


Ser 


Asp 


Ala 


Ala 


Leu 


His 


Pro 




450 










455 









Arg Asp Met Tyr 
465 



(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 
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Ala 


Ser 


Ser 


Pro 


Ser 


Ser 


75 










80 


Asp 


Glu 


Ser 


Pro 


Ser 


Gly 










95 




Asp 


Lys 


Ala 


Ser 


Gly 


Tyr 








110 






Lys 


Gly 


Phe 


Phe 


Arg 


Arg 






125 








Cys 


Asp 


Arg 


Ser 


Cys 


Lys 




140 










Tyr 


Cys 


Arg 


Phe 


His 


Lys 


155 










160 


He 


Arg 


Phe 


Gly 


Arg 


Met 










175 




Glu 


He 


Leu 


Thr 


Cys 


Glu 








190 






Leu 


Lys 


Ser 


Leu 


Ala 


Lys 






205 








Asn 


Met 


Asn 


Lys 


Val 


Lys 




220 










Asn 


Asn 


Pro 


Pro 


Phe 


Val 


235 










240 


Glu 


Lys 


Thr 


Leu 


Val 


Ala 










255 




Glu 


Val 


Glu 


Val 


Arg 


He 








270 






Thr 


Val 


Thr 


Glu 


Leu 


Thr 






285 








Asn 


Leu 


Asp 


Leu 


Asn 


Asp 




300 










Glu 


Ala 


He 


Phe 


Ala 


Met 


315 










320 


Leu 


Val 


Ala 


Tyr 


Gly 


Asn 










335 




Leu 


Arg 


Lys 


Pro 


Phe 


Cys 








350 






Met 


Lys 


Phe 


Asn 


Ala 


Leu 






365 








Val 


Ala 


Ala 


He 


He 


Cys 




380 










Gly 


His 


He 


Glu 


Lys 


Met 


395 










400 


His 


Leu 


Gin 


Ser 


Asn 


His 










415 




Leu 


Gin 


Lys 


Met 


Ala 


Asp 








430 






Leu 


Val 


Gin 


He 


He 


Lys 






445 








Leu 


Leu 


Gin 


Glu 


He 


Tyr 



460 
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60 
120 
180 
240 



600 
660 



(A) LENGTH: 1854 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GGCCCAGGCT GAAGCTCAGG GCCCTGTCTG CTCTGTGGAC TCAACAGTTT GTGGCAAGAC 
AAGCTCAGAA CTGAGAAGCT GTCACCACAG TTCTGGAGGC TGGGAAGTTC AAGATCAAAG 
TGCCAGCAGA TTCAGTGTCA TGTGAGGACG TGCTTCCTGC TTCATAGATA AGAGTAGCTT 
GGAGCTCGGC GGCACAACCA GCACCATCTG GTCGCGATGG TGGACACGGA AAGCCCACTC 

TGCCCCCTCT CCCCACTCGA GGCCGGCGAT CTAGAGAGCC CGTTATCTGA AGAGTTCCTG 3 00 

CAAGAAATGG GAAACATCCA AGAGATTTCG CAATCCATCG GCGAGGATAG TTCTGGAAGC 360 

TTTGGCTTTA CGGAATACCA GTATTTAGGA AGCTGTCCTG GCTCAGATGG CTCGGTCATC 420 

ACGGACACGC TTTCACCAGC TTCGAGCCCC TCCTCGGTGA CTTATCCTGT GGTCCCCGGC 480 

AGCGTGGACG AGTCTCCCAG TGGAGCATTG AACATCGAAT GTAGAATCTG CGGGGACAAG 540 
GCCTCAGGCT ATCATTACGG AGTCCACGCG TGTGAAGGCT GCAAGGGCTT CTTTCGGCGA 
ACGATTCGAC TCAAGCTGGT GTATGACAAG TGCGACCGCA GCTGCAAGAT CCAGAAAAAG 

AACAGAAACA AATGCCAGTA TTGTCGATTT CACAAGTGCC TTTCTGTCGG GATGTCACAC 720 

AACGCGATTC GTTTTGGACG AATGCCAAGA TCTGAGAAAG CAAAACTGAA AGCAGAAATT 780 

CTTACCTGTG AACATGACAT AGAAGATTCT GAAACTGCAG ATCTCAAATC TCTGGCCAAG 840 

AGAATCTACG AGGCCTACTT GAAGAACTTC AACATGAACA AGGTCAAAGC CCGGGTCATC 900 

CTCTCAGGAA AGGCCAGTAA CAATCCACCT TTTGTCATAC ATGATATGGA GACACTGTGT 960 

ATGGCTGAGA AGACGCTGGT GGCCAAGCTG GTGGCCAATG GCATCCAGAA CAAGGAGGTG 1020 
GAGGTCCGCA TCTTTCACTG CTGCCAGTGC ACGTCAGTGG AGACCGTCAC GGAGCTCACG 
GAATTCGCCA AGGCCATCCC AGCGTTCGCA AACTTGGACC TGAACGATCA AGTGACATTG 

CTAAAATACG GAGTTTATGA GGCCATATTC GCCATGCTGT CTTCTGTGAT GAACAAAGAC A<4 uu 

GGGATGCTGG TAGCGTATGG AAATGGGTTT ATAACTCGTG AATTCCTAAA AAGCCTAAGG 1260 

AAACCGTTCT GTGATATCAT GGAACCCAAG TTTGATTTTG CCATGAAGTT CAATGCACTG 1320 

GAACTGGATG ACAGTGATAT CTCCCTTTTT GTGGCTGCTA TCATTTGCTG TGGAGATCGT 13 80 

CCTGGCCTTC TAAACGTAGG ACACATTGAA AAAATGCAGG AGGGTATTGT ACATGTGCTC 1440 

AGACTCCACC TGCAGAGCAA CCACCCGGAC GATATCTTTC TCTTCCCAAA ACTTCTTCAA 1500 

AAAATGGCAG ACCTCCGGCA GCTGGTGACG GAGCATGCGC AGCTGGTGCA GATCATCAAG 1560 

AAGACGGAGT CGGATGCTGC GCTGCACCCG CTACTGCAGG AGATCTACAG GGACATGTAC 1620 

TGAGTTCCTT CAGATCAGCC ACACCTTTTC CAGGAGTTCT GAAGCTGACA GCACTACAAA 1680 

GGAGACGGGG GAGCAGCACG ATTTTGCACA AATATCCACC ACTTTAACCT TAGAGCTTGG 1740 

ACAGTCTGAG CTGTAGGTAA CCGGCATATT ATTCCATATC TTTGTTTTAA CCAGTACTTC 1800 

TAAGAGCATA GAACTCAAAT GCTGGGGGAG GTGGCTAATC TCAGGACTGG GAAG 1854 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Thr Met Val Asp Thr Glu He Ala Phe Trp Pro Thr Asn Phe Gly 
1 5 io 15 



1080 
1140 

1200 
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lie Ser Ser Val 
20 

Asp lie Lys Pro 
35 

His Tyr Glu Asp 
50 

Tyr Lys Tyr Asp 
65 

Glu Pro Ala Ser 

Lys Pro His Glu 
100 

Val Cys Gly Asp 
115 

Glu Gly Cys Lys 
130 

Tyr Asp Arg Cys 
145 

Lys Cys Gin Tyr 

His Asn Ala lie 
180 

Leu Leu Ala Glu 
195 

Ala Asp Leu Arg 
210 

Lys Ser Phe Pro 
225 

Lys Thr Thr Asp 

Met Met Gly Glu 
260 

Glu Gin Ser Lys 
275 

Arg Ser Val Glu 
290 

Pro Gly Phe Val 
305 

Tyr Gly Val His 

Lys Asp Gly Val 
340 

Phe Leu Lys Ser 
355 

Phe Glu Phe Ala 

370 

Leu Ala He Phe 
385 

Leu Leu Asn Val 

Ala Leu Glu Leu 
420 

Phe Ala Lys Leu 
435 

Glu His Val Gin 
450 



Asp Leu Ser Val 

Phe Thr Thr Val 
40 

He Pro Phe Thr 
55 

Leu Lys Leu Gin 
70 

Pro Pro Tyr Tyr 
85 

Glu Pro Ser Asn 

Lys Ala Ser Gly 
120 

Gly Phe Phe Arg 
135 

Asp Leu Asn Cys 
150 

Cys Arg Phe Gin 
165 

Arg Phe Gly Arg 

He Ser Ser Asp 
200 

Gin Ala Leu Ala 
215 

Leu Thr Lys Ala 
230 

Lys Ser Pro Phe 
245 

Asp Lys He Lys 

Glu Val Ala He 
280 

Ala Val Gin Glu 
295 

Asn Leu Asp Leu 

310 

Glu He He Tyr 
325 

Leu He Ser Glu 

Leu Arg Lys Pro 
360 

Val Lys Phe Asn 

375 

He Ala Val He 
390 

Lys Pro He Glu 
405 

Gin Leu Lys Leu 

Leu Gin Lys Met 
440 

Leu Leu Gin Val 
455 



Met Glu Asp His 
25 

Asp Phe Ser Ser 

Arg Thr Asp Pro 
60 

Glu Tyr Gin Ser 
75 

Ser Glu Lys Thr 
90 

Ser Leu Met Ala 
105 

Phe His Tyr Gly 

Arg Thr He Arg 
140 

Arg He His Lys 
155 

Lys Cys Leu Ala 
170 

He Ala Gin Ala 
185 

He Asp Gin Leu 

Lys His Leu Tyr 
220 

Lys Ala Arg Ala 
235 

Val He Tyr Asp 
250 

Phe Lys His He 
265 

Arg He Phe Gin 

He Thr Glu Tyr 
300 

Asn Asp Gin Val 
315 

Thr Met Leu Ala 
330 

Gly Gin Gly Phe 
345 

Phe Gly Asp Phe 

Ala Leu Glu Leu 
380 

lie Leu Ser Gly 
395 

Asp He Gin Asp 
410 

Asn His Pro Glu 
425 

Thr Asp Leu Arg 

He Lys Lys Thr 
460 



Ser His Ser Phe 

30 

He Ser Thr Pro 
45 

Val Val Ala Asp 

Ala He Lys Val 
80 

Gin Leu Tyr Asn 
95 

He Glu Cys Arg 
110 

Val His Ala Cys 
125 

Leu Lys Leu He 

Lys Ser Arg Asn 
160 

Val Gly Met Ser 
175 

Glu Lys Glu Lys 
190 

Asn Pro Glu Ser 
205 

Asp Ser Tyr He 

He Leu Thr Gly 
240 

Met Asn Ser Leu 
255 

Thr Pro Leu Gin 
270 

Gly Cys Gin Phe 
285 

Ala Lys Ser He 

Thr Leu Leu Lys 
320 

Ser Leu Met Asn 
335 

Met Thr Arg Glu 
350 

Met Glu Pro Lys 
365 

Asp Asp Ser Asp 

Asp Arg Pro Gly 
400 

Asn Leu Leu Gin 
415 

Ser Ser Gin Leu 
430 

Gin lie Val Thr 
445 

Glu Thr Asp Met 
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60 



Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys Asp Leu Tyr 
465 470 475 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

CCGACCTTAC CCCAGGCGGC CTTGACGTTG GTCTTGTCGG CAGGAGACAG CACCATGGTG 

GGTTCTCTCT GAGTCTGGGA ATTCCCGAGC CCGAGCCGCA GCCGCCGCCT GGGGGGCTTG 120 

GGTCGGCCTC GAGGACACCG GAGAGGGGCG CCACGCCGCC GTGGCCGCAG AAATGACCAT 180 

GGTTGACACA GAGATCGCAT TCTGGCCCAC CAACTTTGGG ATCAGCTCCG TGGATCTCTC 240 

CGTAATGGAA GACCACTCCC ACTCCTTTGA TATCAAGCCC TTCACTACTG TTGACTTCTC 300 

CAGCATTTCT ACTCCACATT ACGAAGACAT TCCATTCACA AGAACAGATC CAGTGGTTGC 360 

AGATTACAAG TATGACCTGA AACTTCAAGA GTACCAAAGT GCAATCAAAG TGGAGCCTGC 420 

ATCTCCACCT TATTATTCTG AGAAGACTCA GCTCTACAAT AAGCCTCATG AAGAGCCTTC 480 

CAACTCCCTC ATGGCAATTG AATGTCGTGT CTGTGGAGAT AAAGCTTCTG GATTTCACTA 540 

TGGAGTTCAT GCTTGTGAAG GATGCAAGGG TTTCTTCCGG AGAACAATCA GATTGAAGCT 600 

TATCTATGAC AGATGTGATC TTAACTGTCG GATCCACAAA AAAAGTAGAA ATAAATGTCA 660 

GTACTGTCGG TTTCAGAAAT GCCTTGCAGT GGGGATGTCT CATAATGCCA TCAGGTTTGG 720 

GCGGATCGCA CAGGCCGAGA AGGAGAAGCT GTTGGCGGAG ATCTCCAGTG ATATCGACCA 780 

GCTGAATCCA GAGTCCGCTG ACCTCCGTCA GGCCCTGGCA AAACATTTGT ATGACTCATA 840 

CATAAAGTCC TTCCCGCTGA CCAAAGCAAA GGCGAGGGCG ATCTTGACAG GAAAGACAAC 900 

AGACAAATCA CCATTCGTTA TCTATGACAT GAATTCCTTA ATGATGGGAG AAGATAAAAT 960 

CAAGTTCAAA CACATCACCC CCCTGCAGGA GCAGAGCAAA GAGGTGGCCA TCCGCATCTT 1020 

TCAGGGCTGC CAGTTTCGCT CCGTGGAGGC TGTGCAGGAG ATCACAGAGT ATGCCAAAAG 1080 

CATTCCTGGT TTTGTAAATC TTGACTTGAA CGACCAAGTA ACTCTCCTCA AATATGGAGT 1140 

CCACGAGATC ATTTACACAA TGCTGGCCTC CTTGATGAAT AAAGATGGGG TTCTCATATC 1200 

CGAGGGCCAA GGCTTCATGA CAAGGGAGTT TCTAAAGAGC CTGCGAAAGC CTTTTGGTGA 1260 

CTTTATGGAG CCCAAGTTTG AGTTTGCTGT GAAGTTCAAT GCACTGGAAT TAGATGACAG 1320 

CGACTTGGCA ATATTTATTG CTGTCATTAT TCTCAGTGGA GACCGCCCAG GTTTGCTGAA 1380 

TGTGAAGCCC ATTGAAGACA TTCAAGACAA CCTGCTACAA GCCCTGGAGC TCCAGCTGAA 1440 

GCTGAACCAC CCTGAGTCCT CACAGCTGTT TGCCAAGCTG CTCCAGAAAA TGACAGACCT 1500 

CAGACAGATT GTCACGGAAC ACGTGCAGCT ACTGCAGGTG ATCAAGAAGA CGGAGACAGA 1560 

CATGAGTCTT CACCCGCTCC TGCAGGAGAT CTACAAGGAC TTGTACTAGC AGAGAGTCCT 1620 

GAGCCACTGC CAACATTTCC CTTCTTCCAG TTGCACTATT CTGAGGGAAA ATCTGACCAT 1680 

AAGAAATTTA CTGTGAAAAA GCGTTTTAAA AAGAAAAGGG TTTAGAATAT GATCTATTTT 1740 

ATGCATATTG TTTATAAAGA CACATTTACA ATTTACTTTT AATATTAAAA ATTACCATAT 1800 

TATGAAATTG C 1811 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
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(xi) SEQUENCE DESCRIPTION 



Met 


Glu 


Gin 


Pro 


Gin 


Glu 


Glu 


Ala 


1 








5 








Lys 


Glu 


Glu 


Val 


Ala 


Glu 


Ala 


Glu 








20 










Pro 


Gin 


His 


Ala 


Leu 


Pro 


Ser 


Ser 






35 










40 


Ser 


Ser 


Pro 


Pro 


Ser 


Leu 


Leu 


Asp 




50 










55 




Ala 


Ser 


Cys 


Gly 


Ser 


Leu 


Asn 


Met 


65 










70 






Ala 


Ser 


Gly 


Phe 


His 


Tyr 


Gly 


Val 










85 








Phe 


Phe 


Arg 


Arg 


Thr 


He 


Arg 


Met 








100 










Arg 


Ser 


Cys 


Lys 


lie 


Gin 


Lys 


Lys 






115 










120 


Arg 


Phe 


Gin 


Lys 


Cys 


Leu 


Ala 


Leu 




130 










135 




Phe 


Gly 


Arg 


Met 


Pro 


Glu 


Ala 


Glu 


145 










150 






Thr 


Ala 


Asn 


Glu 


Gly 


Ser 


Gin 


Tyr 










165 








Ala 


Phe 


Ser 


Lys 


His 


He 


Tyr 


Asn 








180 










Thr 


Lys 


Lys 


Lys 


Ala 


Arg 


Ser 


He 






195 










200 


Ala 


Pro 


Phe 


Val 


He 


His 


Asp 


He 




210 










215 




Gly 


Leu 


Val 


Trp 


Lys 


Gin 


Leu 


Val 


225 










230 






He 


Ser 


Val 


His 


Val 


Phe 


Tyr 


Arg 










245 








Val 


Arg 


Glu 


Leu 


Thr 


Glu 


Phe 


Ala 








260 










Leu 


Phe 


Leu 


Asn 


Asp 


Gin 


Val 


Thr 






275 










280 


Ala 


He 


Phe 


Ala 


Met 


Leu 


Ala 


Ser 




290 










295 




Val 


Ala 


Asn 


Gly 


Ser 


Gly 


Phe 


Val 


305 










310 






Arg 


Lys 


Pro 


Phe 


Ser 


Asp 


He 


He 










325 








Lys 


Phe 


Asn 


Ala 


Leu 


Glu 


Leu 


Asp 








340 










Ala 


Ala 


He 


He 


Leu 


Cys 


Gly 


Asp 






355 










360 


Arg 


Val 


Glu 


Ala 


He 


Gin 


Asp 


Thr 




370 










375 




Leu 


Gin 


Ala 


Asn 


His 


Pro 


Asp 


Ala 


385 










390 




Gin 


Lys 


Met 


Ala 


Asp 


Leu 


Arg 


Gin 



405 



: SEQ ID NO: 7: 



Pro 


Glu 


Val 


Arg 


Glu 


Glu 


Glu 


Glu 




10 










15 




Gly 


Ala 


Pro 


Glu 


Leu 


Asn 


Gly 


Gly 


25 










30 






Ser 


Tyr 


Thr 


Asp 


Leu 


Ser 


Arg 


Ser 










45 








Gin 


Leu 


Gin 


Met 


Gly 


Cys 


Asp Gly 








60 










Glu 


Cys 


Arg 


Val 


Cys 


Gly 


Asp 


Lys 






75 










80 


His 


Ala 


Cys 


Glu 


Gly 


Cys 


Lys 


Gly 




90 










95 




Lys 


Leu 


Glu 


Tyr 


Glu 


Lys 


Cys 


Glu 


105 










110 






Asn 


Arg 


Asn 


Lys 


Cys 


Gin 


Tyr 


Cys 










125 








Gly 


Met 


Ser 


His 


Asn 


Ala 


He 


Arg 








140 








Lys 


Arg 


Lys 


Leu 


Val 


Ala 


Gly 


Leu 






155 










160 


Asn 


Pro 


Gin 


Val 


Ala 


Asp 


Leu 


Lys 




170 










175 




Ala 


Tyr 


Leu 


Lys 


Asn 


Phe 


Asn 


Met 


185 










190 






Leu 


Thr 


Gly 


Lys 


Ala 


Ser 


His 


Thr 










205 








Glu 


Thr 


Leu 


Trp 


Gin 


Ala 


Glu 


Lys 








220 










Asn 


Gly 


Leu 


Pro 


Pro 


Tyr 


Lys 


Glu 






235 










240 


Cys 


Gin 


Cys 


Thr 


Thr 


Val 


Glu 


Thr 




250 










255 




Lys 


Ser 


He 


Pro 


Ser 


Phe 


Ser 


Ser 


265 










270 






Leu 


Leu 


Lys 


Tyr Gly 


Val 


His 


Glu 










285 








lie 


Val 


Asn 


Lys Asp Gly 


Leu 


Leu 








300 










Thr 


Arg 


Glu 


Phe 


Leu 


Arg 


Ser 


Leu 






315 










320 


Glu 


Pro 


Lys 


Phe 


Glu 


Phe 


Ala 


Val 




330 










335 




Asp 


Ser 


Asp 


Leu 


Ala 


Leu 


Phe 


He 


345 










350 






Arg 


Pro 


Gly 


Leu 


Met 


Asn 


Val 


Pro 










365 








He 


Leu 


Arg 


Ala 


Leu 


Glu 


Phe 


His 








380 










Gin 


Tyr 


Leu 


Phe 


Pro 


Lys 


Leu 


Leu 






395 










400 


Leu 


Val 


Thr 


Glu 


His 


Ala 


Gin 


Met 




410 










415 
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Met Gin Arg He Lys Lys Thr Glu Thr Glu Thr Ser Leu His Pro Leu 

420 425 430 

Leu Gin Glu He Tyr Lys Asp Met Tyr 
435 440 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3301 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GAATTCTGCG GAGCCTGCGG GACGGCGGCG GGTTGGCCCG TAGGCAGCCG GGACAGTGTT 60 

GTACAGTGTT TTGGGCATGC ACGTGATACT CACACAGTGG CTTCTGCTCA CCAACAGATG 120 

AAGACAGATG CACCAACGAG GGTCTGGAAT GGTCTGGAGT GGTCTGGAAA GCAGGGTCAG 180 

ATACCCCTGG AAAACTGAAG CCCGTGGAGC AATGATCTCT ACAGGACTGC TTCAAGGCTG 240 

ATGGGAACCA CCCTGTAGAG GTCCATCTGC GTTCAGACCC AGACGATGCC AGAGCTATGA 300 

CTGGGCCTGC AGGTGTGGCG CCGAGGGGAG ATCAGCCATG GAGCAGCCAC AGGAGGAAGC 3 60 

CCCTGAGGTC CGGGAAGAGG AGGAGAAAGA GGAAGTGGCA GAGGCAGAAG GAGCCCCAGA 420 

GCTCAATGGG GGACCACAGC ATGCACTTCC TTCCAGCAGC TACACAGACC TCTCCCGGAG 480 

CTCCTCGCCA CCCTCACTGC TGGACCAACT GCAGATGGGC TGTGACGGGG CCTCATGCGG 540 

CAGCCTCAAC ATGGAGTGCC GGGTGTGCGG GGACAAGGCA TCGGGCTTCC ACTACGGTGT 600 

TCATGCATGT GAGGGGTGCA AGGGCTTCTT CCGTCGTACG ATCCGCATGA AGCTGGAGTA 660 

CGAGAAGTGT GAGCGCAGCT GCAAGATTCA GAAGAAGAAC CGCAACAAGT GCCAGTACTG 720 

CCGCTTCCAG AAGTGCCTGG CACTGGGCAT GTCACACAAC GCTATCCGTT TTGGTCGGAT 780 

GCCGGAGGCT GAGAAGAGGA AGCTGGTGGC AGGGCTGACT GCAAACGAGG GGAGCCAGTA 840 

CAACCCACAG GTGGCCGACC TGAAGGCCTT CTCCAAGCAC ATCTACAATG CCTACCTGAA 900 

AAACTTCAAC ATGACCAAAA AGAAGGCCCG CAGCATCCTC ACCGGCAAAG CCAGCCACAC 960 

GGCGCCCTTT GTGATCCACG ACATCGAGAC ATTGTGGCAG GCAGAGAAGG GGCTGGTGTG 1020 

GAAGCAGTTG GTGAATGGCC TGCCTCCCTA CAAGGAGATC AGCGTGCACG TCTTCTACCG 1080 

CTGCCAGTGC ACCACAGTGG AGACCGTGCG GGAGCTCACT GAGTTCGCCA AGAGCATCCC 1140 

CAGCTTCAGC AGCCTCTTCC TCAACGACCA GGTTACCCTT CTCAAGTATG GCGTGCACGA 1200 

GGCCATCTTC GCCATGCTGG CCTCTATCGT CAACAAGGAC GGGCTGCTGG TAGCCAACGG 1260 

CAGTGGCTTT GTCACCCGTG AGTTCCTGCG CAGCCTCCGC AAACCCTTCA GTGATATCAT 1320 

TGAGCCTAAG TTTGAATTTG CTGTCAAGTT CAACGCCCTG GAACTTGATG ACAGTGACCT 1380 

GGCCCTATTC ATTGCGGCCA TCATTCTGTG TGGAGACCGG CCAGGCCTCA TGAACGTTCC 1440 

ACGGGTGGAG GCTATCCAGG ACACCATCCT GCGTGCCCTC GAATTCCACC TGCAGGCCAA 1500 

CCACCCTGAT GCCCAGTACC TCTTCCCCAA GCTGCTGCAG AAGATGGCTG ACCTGCGGCA 1560 

ACTGGTCACC GAGCACGCCC AGATGATGCA GCGGATCAAG AAGACCGAAA CCGAGACCTC 1620 

GCTGCACCCT CTGCTCCAGG AGATCTACAA GGACATGTAC TAACGGCGGC ACCCAGGCCT 1680 

CCCTGCAGAC TCCAATGGGG CCAGCACTGG AGGGGCCCAC CCACATGACT TTTCCATTGA 1740 

CCAGCTCTCT TCCTGTCTTT GTTGTCTCCC TCTTTCTCAG TTCCTCTTTC TTTTCTAATT 1800 

CCTGTTGCTC TGTTTCTTCC TTTCTGTAGG TTTCTCTCTT CCCTTCTCCC TTCTCCCTTG 1860 

CCCTCCCTTT CTCTCTCCTA TCCCCACGTC TGTCCTCCTT TCTTATTCTG TGAGATGTTT 1920 

TGTATTATTT CACCAGCAGC ATAGAACAGG ACCTCTGCTT TTGCACACCT TTTCCCCAGG 1980 

AGCAGAAGAG AGTGGGCCTG CCCTCTGCCC CATCATTGCA CCTGCAGGCT TAGGTCCTCA 2040 

CTTCTGTCTC CTGTCTTCAG AGCAAAAGAC TTGAGCCATC CAAAGAAACA CTAAGCTCTC 2100 

TGGGCCTGGG TTCCAGGGAA GGCTAAGCAT GGCCTGGACT GACTGCAGCC CCCTATAGTC 2160 

ATGGGGTCCC TGCTGCAAAG GACAGTGGCA GACCCCGGCA GTAGAGCCGA GATGCCTCCC 2220 

CAAGACTGTC ATTGCCCCTC CGATCGTGAG GCCACCCACT GACCCAATGA TCCTCTCCAG 2280 

CAGCACACCT CAGCCCCACT GACACCCAGT GTCCTTCCAT CTTCACACTG GTTTGCCAGG 2340 
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CCAATGTTGC TGATGGCCCC TCCAGCACAC 
AGGCACCATG CACCTCCCTT CCCTCCCTGA 
CAGGTGAGCA GGCAGGGCTG GGCCAGGTCT 
TGGGTCAGCC CAGCACCTCG CCCAGTGGGA 
CTGATGTCCA TTTGTCCCAA TAGCTCTACT 
TGGCCACCTA GAAGTCTCCC TGCACAGCCT 
CCACACCGCT GGTCCCTGCC CTCCCCTGCT 
GGGCCAAAGC ACAGCTGGGC ATGCCATGTC 
GGGCAAGGGG CTGGCTGGAG TCTCAGAGCA 
GGCTTCCTGG GTCCTGCCTG GTCCTCCCTC 
GAAGTGCCCA GCCCCTGCCT GACGGNNNNN 
CGCTCCCCAC CTACCCAGCT GATGGGGGTT 
GGACAGCTGC TGGGACCCAC CTCCCCCCTT 
CCCGGGTCTG GTGCTGAGGA TACAGCTCTT 
ATGTATATTT TTGCTAGGAG CCCCAGCTTC 
ACTGACGAAA CTTTAAATAA ATGGGAATTA 
C 



ACACATAAGC ACTGAAATCA CTTTACCTGC 
GGCAGGTGAG AACCCAGAGA GAGGQGCCTG 
CCGGGGAGGC AGGGGTCCTG CAGGTCCTGG 
GCTTCCCGGG ATAAACTGAG CCTGTTCATT 
GCCCTCCCCT TCCCCTTTAC TCAGCCCAGC 
CTAGTGTCCG GGGACCTTGT GGGACCAGTC 
CCCAGGTTGA GGTGCGCTCA CCTCAGAGCA 
TGAGCGGCGC AGAGCCCTCC AGGCCTGCAG 
CAGAGGTAGG AGAACTGGGG TTCAAGCCCA 
CCAAGGAGCC ATTCTATGTG ACTCTGGGTG 
NNGATCACTC TCTGCTGGCA GGATTCTTCC 
GGGGTGCTTC TTTCAGCCAA GGCTATGAAG 
CCCCGGCCAC ATGCCGCGTC CCTGCCCCCA 
CTCAGTGTCT GAACAATCTC CAAAATTGAA 
CTGTGTTTTT AATATAAATA GTGTACACAG 
AATATTTAAA AAAAAAAGCG GCCGCGAATT 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3301 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
ACTCGGATCC AAGCCATGGC TGAGAACTTG CTGGACGG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CACAAAGCTT AGGCCATGTT AGCACTGTTC GG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTCAGTCGAC TTATTGAATT CCACTAGCTG GAGATCC 37 
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